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U S DEPARTMENT OF COMMERCE PATENT AND 



TRANSMITTAL LETTER TO THE UNITED STATES 
DESIGNATED/ELECTED OFFICE (DO/EO/US) 
CONCERNING A FILING UNDER 35 U.S.C. 371 



INTERNATIONAL APPLICATION NO. 
PCT/US99/19443 



INTERNATIONAL FILING DATE 
24 August 1999 (24.08.99) 



ATTORNEY'S DOCKET NUMBER 



U.S APPLICATION N 



09/763331 



PRIORITY DATE CLAIMED 
24 August 1998 (24.08.98) 



TITLE OF INVENTION Synthetic Fatty Acid Desaturase Gene for Expression in Plants 



APPLICANT(S) FOR DO/EO/US MARTIN, Charles E.; MITCHELL, Andrew 



Applicant herewith submits to the United States Designated/Elected Office (DO/EO/US) the following items and other information: 
1-13 Tb^s a FIRST submission of items concerning a filing under 35 U.S.C. 371. 

2.Q This is a SECOND or SUBSEQUENT submission of items concerning a filmg under 35 U.S.C. 371. 
3.0 This is an express request to begin national examination procedures (35 U.S.C. 371(f)). The submission must include 
items (5), (6), (9) and (21) indicated below. 

4. r~| The us has been elected by the expiration of 19 months from the priority date (Article 31). 

5. [xl A copy of the International Application as filed (35 U.S.C. 371(c)(2)) 

a. □ is attached hereto (required only if not communicated by the International Bureau), 
r-i b. LH has been communicated by the International Bureau. 

3 c. H is not required, as the application was filed in the United States Receiving Office (RO/US). 
6 j] An English language translation of the International Application as filed (35 U.S.C. 371(c)(2)). 
yi a. n is attached hereto. 

Ivt b. EH has been previously submitted under 35 U.S.C. 154(d)(4). 

Amendments to the claims of the International Aplication under PCT Article 19 (35 U.S.C. 371(c)(3)) 
H a. O are attached hereto (required only if not communicated by the International Bureau). 
5=. h- n have been communicated by the International Bureau. 

c'p c. □ have not been made; however, the time limit for making such amendments has NOT expired. 
iti d. IX] have not been made and will not be made. 

An English language translation of the amendments to the claims under PCT Article 19 (35 U.S.C. 371 (c)(3)). 
I;=Q An oath or declaration of the inventor(s) (35 U.S.C. 371(c)(4)). 
IC. □ An Enghsh lanugage translation of the annexes of the International Preliminary Examination Report under PCT 
Article 36 (35 U.S.C. 371(c)(5)). 

Items 11 to 20 belovt' concern document(s) or information included: 

11. □ An Information Disclosure Statement under 37 CFR 1.97 and 1.98. 

12. n An assignment document for recording. A separate cover sheet in comphance with 37 CFR 3.28 and 3.31 is included. 

13. [Z! ^ FIRST preliminary amendment. 

14. \Z\ A SECOND or SUBSEQUENT preliminary amendment. 

15. Q A substitute specification. 

16. Q A change of power of attorney and/or address letter. 

A computer-readable form of the sequence hsting in accordance with PCT Rule 13ter.2 and 35 U.S.C. 1.821 - 1.825. 

18. |~] A second copy of the published international apphcation under 35 U.S.C. 154(d)(4). 

19. □ A second copy of the English language translaUon of the international apphcation under 35 U.S.C. 154(d)(4). 

20. CH Other items or informaUon: 
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APPLICATION NO 

PCT/US99/19443 



ATTORNEY'S DOCKET NUMBER 

13259-00008 



21. |2y. The fpUowing fees are submitted: 
BASIC NATIONAL FEE (37 CFR 1.492 (a) (1) - (5)): 

Neither international preliminary examination fee (37 CFR 1.482) 
nor international search fee (37 CFR 1.445(a (2)) paid to USPTO 

and International Search Report not prepared by the EPO or JPO $1000.00 

International preliminary examination fee (37 CFR 1.482) not paid to 

USPTO but International Search Report prepared by the EPO or JPO $860.00 

International preliminary examination fee (37 CFR 1.482) not paid to USPTO 

hut international search fee (37 CFR 1.445(a)(2)) paid to USPTO S710.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

but all claims did not satisfy provisions of PCT Article 33(l)-(4) $690.00 

International preliminary examination fee (37 CFR 1.482) paid to USPTO 

and all claims satisfied provisions of PCT Article 33(l)-(4) $100.00 

ENTER APPROPRIATE BASIC FEE AMOUNT = 



CALCULATIONS PTO USE ONLY 



Surcharge of $130.00 for furnishing the oath or declaration later than Q 20 | | 30 
months from the earliest claimed priority date (37 CFR 1.492(e)). 



NUMBER FILED 



NUMBER EXTRA 



Inifpendent claims 



M^TIPLE DEPENDENT CLAIM(S) (if applicable) 



TOTAL OF ABOVE CALCULATIONS = 



Applicant claims small entity status. See 37 CFR 1.27. The fees indicated above 
'l are reduced by 1/2. 



SUBTOTAL 



ltt5cessing fee of $130.00 for furnishing the Enghsh translation later than Q 20 I I 30 
iMnths fi-om the earhest claimed priority date (37 CFR 1.492(f)). 



TOTAL NATIONAL FEE 



for recording the enclosed assignment (37 CFR 1.21(h)). The assignment must be 
^lompanied by an appropriate cover sheet (37 CFR 3.28, 3.31). $40.00 per property ■ 



TOTAL FEES ENCLOSED 



charged: 



A check in the amount of $ 59.00 



r the above fees is enclosed. 



n the amount of $ _ 



to cover the above fees. 



b. |~~] Please charge my Deposit Account No. 

A duplicate copy of this sheet is enclosed. 

c. |x| The Commissioner is hereby authorized to charge any additional fees which may be required, or credit any 

overpayment to Deposit Account No. 50-1089 A dupHcate copy of this sheet is enclosed. 

d. Q Fees are to be charged to a credit card. WARNING: Information on this form may become public. Credit card 

information should not be included on this form. Provide credit card information and authorization on PTO-2038. 



NOTE: Where an appropriate time limit under 37 CFR 1.494 or 1.495 has not been met, a^petition to revive (37 CFR 
1.137 (a) or (b)) must be filed and granted to restore the application to pef " ^ ^ 

SEND ALL CORRESPONDENCE TO: 

Janet E. Reed 
Saul Ewing 

1500 Market Street, 38th Floor 
Centre Square West 
Philadelphia, PA 19102 




-NATURE 

Janet E. Reed 



NAME 

36,252 



REGISTRATION NUMBER 



JKMtWdPGT/TTO 0 UN 200! 



09/?6333l 



IN THE UNITED STATES PATENT AND TRADEMARK OFFICE 



Application No. 09/763,331 

Intl. Filing Date: 8/24/99 

For: Synthetic Fatty Acid 
Desaturase Gene for 
Expression in Plants 



A Notice to Comply with the requirements of 37. C.F.R. 



§§ 1.821 - 1.825 was issued on April 26, 2001 in connection with 
the above-referenced patent application. A copy of the Notice is 
enclosed herewith. Applicants were given two months to respond to 
the Notice. 



To comply with the requirements of 3 7 C.F.R. §§1.821- 



1,825, submitted herewith is a computer- readable form of a 
sequence listing in connection with the above- identified patent 
application. The paper form of the sequence listing has already 
been incorporated into the application as filed. The undersigned 
attorney hereby verifies that the sequences disclosed in the 
paper form are the same as the corresponding sequences disclosed 
in the computer- readable form, and that no new matter has been 
added to the specification by the submiss ion of this sequence 
listing . 



In re the Application of 



Examiner : J . Anderson 



Charles E. Martin & Andrew Mitchell 



Group Art Unit : 



SUBMISSION OF SEQUENCE LISTING 



Respectfully submitted. 



SAUL EWING, L.L.P. 



Telephone: (215) 972-8386 
Facsimile: (215) 972-2292 



By 
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SAUL EWING LLP 09/763331 

Centre Square West 1500 Market Street 38*" Floor 

Philadelphia, PA 19102-2186 
Telephone (215) 972-7777 Facsimile (215) 972-7725 



COMMISSIONER OF PATENTS 
AND TRADEMARKS 
Washington DC 20231 

BOX PCT 


Date of Mailing 
February 22, 2001 


Applicant's or Agent's Reference 
Rutgers 13259-00008 


IDENTIFICATION OF THE INTERNATIONAL APPLICATION 


Int'l Appln. No. 
PCT/US99/19443 


Int'l Filing Date 
24 August 1999 


Applicant (name) 
MARTIN, Charles et al. 


tt^NSMITTAL OF LETTER TO THE UNITED STATES DESIGNATED/ELECTED 
5 OFFICE (DO/EO/US) CONCERNING A FILING UNDER 35 U.S.C. 371 



W 

!J CERTIFICATE OF MAILING 

r" BY EXPRESS MAIL UNDER 37 CFR §1. 10 



r#MBER OF EXPRESS MAIL MAILING LABEL ELSflSM4n3flUS 

DATE OF DEPOSIT WITH POSTAL SERVICE February 22, 2001 



I hereby certify that this paper or fee is being deposited with the United States Postal 
Service "EXPRESS MAIL POST OFFICE TO ADDRESSEE" service under 37 C.F.R. §1.10 on the date 
indicated above, and is addressed to the COMMISSIONER OF PATENTS AND TRADEMARKS, 
Washington, D.C. 20231 



Susan C. Sheehan 



Typed or Printed Name of Person 
Mailing Paper or Fee 



ture of Perso^'Mailing Paper or Fee 



581320.1 2/21/01 



Page 1 of 7 



PCT09 



9 
12 

C~> 14 



19 

2.1 
%^ 

% 

m3 



37 
iJ8 



RAW SEQUENCE LISTING 

PATENT APPLICATION: US/09/763,331 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\07112001\l763331.raw 

<110> APPLICANT: Martin, Charles E. 

Mitchell, Andrew 
<120> TITLE OF INVENTION: Synthetic Fatty Acid Desatur, 

Expression in Plants 
<130> FILE REFERENCE: 97-0081 PCT 
<140> CURRENT APPLICATION NUMBER: US/09/763,331 
<141> CURRENT FILING DATE: 1999-08-24 
<150> PRIOR APPLICATION NUMBER: US 60/097,586 
<151> PRIOR FILING DATE: 1998-08-24 
<16Q> NUMBER OF SEQ ID NOS : 8 

<17 0> SOFTWARE: FastSEQ for Windows Version 4.0 
<210> SEQ ID NO: 1 
<211> LENGTH: 1555 
<212> TYPE: DNA 

<213> ORGANISM: Saccharomyces cerevisiae 
<4 00> SEQUENCE: 1 

tacaacaaag atgccaactt ctggaactac tattgaattg attgacgacc 
ggatgactct gccagcagtg gcattgtcga cgaagtcgac ttaacggaag 
ggctactggt ttgaataaga aagcaccaag aattgtcaac ggttttggtt 
ctccaaggaa atggtttccg tggaattcga caagaaggga aacgaaaaga 
ggatcgtctg ctagaaaagg acaaccaaga aaaagaagaa gctaaaacta 
ctccgaacaa ccatggactt tgaataactg gcaccaacat ttgaactggt 
tcttgtttgt ggtatgccaa tgattggttg gtacttcgct ctctctggta 
gcatttaaac gttttccttt tctccgtttt ctactacgct gtcggtggtg 
tgccggttac catagattat ggtctcacag atcttactcc gctcactggc 
attctacgct atcttcggtt gtgcttccgt tgaagggtcc gctaaatggt 
tcacagaatt caccatcgtt acactgatac cttgagagat ccttatgacg 
tctatggtac tcccacatgg gatggatgct tttgaagcca aatccaaaat 
agctgatatt accgatatga ctgatgattg gaccattaga ttccaacaca 
cttgttgatg ttattaaccg ctttcgtcat tccaactctt atctgtggtt 
cgactatatg ggtggtttga tctatgccgg ttttattcgt gtctttgtca 
taccttttgc attaactcca tggctcatta catcggtacc caaccattcg 
aacccctcgt gacaactgga ttactgccat tgttactttc ggtgaaggtt 
ccaccacgaa ttcccaactg attacagaaa cgctattaag tggtaccaat 
taaggttatc atctatttga cttctttagt tggtctagca tacgacttga 
tcaaaatgct attgaagaag ccttgattca acaagaacaa aagaagatca 
ggctaagatt aactggggtc cagttttgac tgatttgcca atgtgggaca 
cttggctaag tctaaggaaa acaagggttt ggttatcatt tctggtattg 
atctggttat atctctgaac atccaggtgg tgaaacttta attaaaactg 
ggacgctacc aaggctttca gtggtggtgt ctaccgtcac tcaaatgccg 
cttggctgat atgagagtgg ctgttatcaa ggaaagtaag aactctgcta 
tagtaagaga ggtgaaatct acgaaactgg taagttcttt taagcatcac 
<210> SEQ ID NO: 2 
<211> LENGTH: 510 
<212> TYPE: PRT 

<213> ORGANISM: Saccharomyces cerevisiae 
<400> SEQUENCE: 2 



DATE: 07/11/2001 
TIME: 11:25:16 



ENTERED 



ase Gene for 



aatttccaaa 
ctaatatttt 
ctttaatggg 
agtccaattt 
aaattcacat 
tgaacatggt 
aagtaccttt 
tttctattac 
cattgagatt 
ggggccactc 
ctcgtagagg 
acaaggctag 
gacactacat 
actttttcaa 
ttcaacaagc 
atgacagaag 
accataactt 
acgacccaac 
agaaattctc 
ataaaaagaa 
aacaaacctt 
ttcacgacgt 
cattaggtaa 
ctcaaaatgt 
ttagaatggc 
attac 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1555 
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RAW SEQUENCE LISTING DATE: 07/11/2001 

PATENT APPLICATION: US/09/763,331 TIME: 11:25:16 

Input Set : A:\PTO.txt 

Output Set: N:\CRF3\07112001\l763331.raw 

60 Met Pro Thr Ser Gly Thr Thr He Glu Leu He Asp Asp Gin Phe Pro 



61 1 



15 



62 Lys Asp Asp Ser Ala Ser Ser Gly He Val Asp Glu Val Asp Leu Thr 

63 20 25 30 

64 Glu Ala Asn He Leu Ala Thr Gly Leu Asn Lys Lys Ala Pro Arg He 

65 35 40 45 

66 Val Asn Gly Phe Gly Ser Leu Met Gly Ser Lys Glu Met Val Ser Val 

67 50 55 60 

68 Glu Phe Asp Lys Lys Gly Asn Glu Lys Lys Ser Asn Leu Asp Arg Leu 

69 65 70 75 80 

70 Leu Glu Lys Asp Asn Gin Glu Lys Glu Glu Ala Lys Thr Lys He His 

71 85 90 95 

72 He Ser Glu Gin Pro Trp Thr Leu Asn Asn Trp His Gin His Leu Asn 

73 100 105 110 

74 Trp Leu Asn Met Val Leu Val Cys Gly Met Pro Met He Gly Trp Tyr 

75 115 120 125 

J.e Phe Ala Leu Ser Gly Lys Val Pro Leu His Leu Asn Val Phe Leu Phe 
"27 130 135 140 

;#8 Ser Val Phe Tyr Tyr Ala Val Gly Gly Val Ser He Thr Ala Gly Tyr 

J9 145 150 155 160 

pao His Arg Leu Trp Ser His Arg Ser Tyr Ser Ala His Trp Pro Leu Arg 

Wl 165 170 175 

i|2 Leu Phe Tyr Ala He Phe Gly Cys Ala Ser Val Glu Gly Ser Ala Lys 

iJ3 180 185 190 

■£|4 Trp Trp Gly His Ser His Arg He His His Arg Tyr Thr Asp Thr Leu 

,85 195 200 205 

;a6 Arg Asp Pro Tyr Asp Ala Arg Arg Gly Leu Trp Tyr Ser His Met Gly 

210 215 220 

f§8 Trp Met Leu Leu Lys Pro Asn Pro Lys Tyr Lys Ala Arg Ala Asp He 
m 225 230 235 240 

ISO Thr Asp Met Thr Asp Asp Trp Thr He Arg Phe Gin His Arg His Tyr 
Si 245 250 255 

|a2 He Leu Leu Met Leu Leu Thr Ala Phe Val He Pro Thr Leu He Cys 

93 260 265 270 

94 Gly Tyr Phe Phe Asn Asp Tyr Met Gly Gly Leu He Tyr Ala Gly Phe 

95 275 280 285 

96 He Arg Val Phe Val He Gin Gin Ala Thr Phe Cys He Asn Ser Leu 

97 290 295 300 

98 Ala His Tyr He Gly Thr Gin Pro Phe Asp Asp Arg Arg Thr Pro Arg 

99 305 310 315 320 

100 Asp Asn Trp He Thr Ala He Val Thr Phe Gly Glu Gly Tyr His Asn 

101 325 330 335 

102 Phe His His Glu Phe Pro Thr Asp Tyr Arg Asn Ala He Lys Trp Tyr 

103 340 345 350 

104 Gin Tyr Asp Pro Thr Lys Val He He Tyr Leu Thr Ser Leu Val Gly 

105 355 360 365 

106 Leu Ala Tyr Asp Leu Lys Lys Phe Ser Gin Asn Ala He Glu Glu Ala 

107 370 375 380 

108 Leu He Gin Gin Glu Gin Lys Lys He Asn Lys Lys Lys Ala Lys He 
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Gly 
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Ala 
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123 500 505 510 

126 <210> SEQ ID NO: 3 
^,427 <211> LENGTH: 1555 
^28 <212> TYPE: DNA 

:#29 <213> ORGANISM: Artificial Sequence 
M3I <220> FEATURE: 

il32 <223> OTHER INFORMATION: synthetic yeast delta-9 desaturase gene modified 
1033 for expression in plants 

1^35 <400> SEQUENCE: 3 

1^36 ggatccaaca atgcctactt ctggaactac tatcgagctt atcgatgatc aattccctaa 60 

^37 ggatgattct gcttcttctg gaatcgttga tgaggttgat cttactgagg ctaacatcct 120 

H38 tgctactgga cttaacaaga aggctcctag aatcgttaac ggattcggat ctcttatggg 180 

p^39 atctaaggag atggtttctg ttgagttcga taagaaggga aacgagaaga agtctaacct 240 

J24 0 tgatagactt cttgagaagg ataaccaaga gaaggaggag gctaagacta agatccatat 300 

!=*|41 ctctgagcaa ccttggactc tcaacaactg gcatcaacat ctcaactggc tcaacatggt 360 

^42 gctcgtctgt ggaatgccta tgatcggatg gtacttcgct ctctctggaa aagtgcctct 420 

Q43 ccatctcaac gttttcctct tctctgtctt ctactacgct gttggaggag tgtctatcac 480 

044 tgctggatac catagactct ggtctcatag atcttactct gctcattggc ctcttagact 540 

1=^45 cttctacgct atctttggat gtgcttctgt tgagggatct gctaagtggt ggggacattc 600 

14 6 tcatagaatc catcatagat acactgatac tcttagagat ccttacgatg ctagaagagg 660 

147 actttggtac tctcatatgg gatggatgct tcttaagcct aaccctaagt acaaggctag 720 

148 agctgatatc actgatatga ctgatgattg gactatcaga ttccaacata gacattacat 780 
14 9 cttgctcatg ctccttactg ctttcgtgat ccctactctc atctgtggat acttcttcaa 840 

150 cgattacatg ggaggactca tctacgctgg attcatcaga gtgttcgtca tccaacaagc 900 

151 tactttctgt atcaactcta tggctcatta catcggaact caacctttcg atgatagaag 960 

152 aactcctaga gataactgga tcactgctat cgttactttc ggagagggat accataactt 1020 

153 ccatcatgag ttccctactg attatagaaa cgctatcaag tggtaccaat acgatcctac 1080 

154 taaagtgatc atctacttga cttctctcgt gggacttgct tacgatctca agaagttctc 1140 

155 tcaaaacgct atcgaggagg ctcttatcca acaagagcaa aagaagatca acaagaagaa 1200 

156 ggctaagatt aattggggac ctgttcttac tgatcttcct atgtgggata agcaaacttt 1260 

157 ccttgctaag tctaaggaga acaagggact tgttatcatc tctggaatcg ttcatgatgt 1320 

158 ttctggatac atctctgagc atcctggagg agagacttta attaagactg ctcttggaaa 1380 

159 ggatgctact aaggctttct ctggaggagt ttacagacat tctaacgctg ctcaaaacgt 1440 

160 gcttgctgat atgagagttg ctgttatcaa ggagtctaag aactctgcta tcagaatggc 1500 

161 ttctaagaga ggagagatct acgagactgg aaagttcttc tgatctagag gatcc 1555 
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Ala 


Val 


Cys 


Phe 


Gly 


Leu 


Tyr 


198 


225 








230 










235 








240 


199 


Arg 


Tyr Ala 


Ala 


Ala 


Gin 


Gly Met 


Ala 


Ser 


Met 


He 


Cys 




Tyr 


Gly 


200 








245 










250 








255 


201 


Val 


Pro Leu 


Leu 


He 


Val 


Asn 


Ala 


Phe 


Leu 


Val 


Leu 


He 


Thr 


Tyr 




202 






260 










265 










270 




203 


Gin 


His Thr 


His 


Pro 


Ser 


Leu 


Pro 


His 


Tyr 


Asp 


Ser 


Ser 


Glu 


Trp 


Asp 


204 




275 










280 










285 




205 


Trp 


Leu Arg 


Gly Ala 


Leu 


Ala 


Thr 


Val 


Asp 


Arg 


Asp 


Tyr 


Gly 


He 


Leu 


206 




290 








295 










300 










207 


Asn 


Lys Val 


Phe 


His 


Asn 


He 


Thr 


Asp 


Thr 


His 


Val 


Ala 


His 


His 


Leu 


208 


305 








310 










315 










320 


209 


Phe 


Ser Thr 


Met 


Pro 


His 


Tyr 


Asn 


Ala 


Met 


Glu 


Ala 


Thr 


Lys 


Ala 


He 


210 








325 










330 








335 




211 


Lys 


Pro He 


Leu 


Gly 


Asp 


Tyr 


Tyr 


Gin 


Phe 


Asp 


Gly 


Thr 


Pro 


Trp 


Tyr 


212 






340 










345 










350 
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213 Val Ala Met Tyr Arg Glu Ala Lys Glu Cys He Tyr Val Glu Pro Asp 

214 355 360 365 

215 Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys Leu 

216 370 375 380 

219 <210> SEQ ID NO: 5 

220 <211> LENGTH: 1372 

221 <212> TYPE: DNA 

222 <213> ORGANISM: Arabidopsis thaliana 

224 <400> SEQUENCE: 5 

225 agagagagag attctgcgga 

226 atcgccccta cgtcagctcc 

227 ctacttcttc caagaaatcg 

228 ctttctcggt gggagatctg 

229 ctcgctcttt ctcctacctt 

230 ccaccaatta cttctctctc 

231 gggcctgtca aggctgtgtc 
^^32 acgcattcag cgactaccaa 
*j33 tcctcgtccc ttacttctcc 
;*g34 Gcctcgaaag agatgaagta 
~'i35 aatacctcaa caaccctctt 
0136 ggcccttgta cttagccttt 
i^|37 tcttccccaa cgctcccatc 
1=138 cgggtattct agccgtctgt 
|^|39 cgatgatctg cctctacgga 
£j4 0 cttacttgca gcacactcat 
"241 tcaggggagc tttggctacc 
''x£i2 acattacaga cacacacgtg 
'^43 tggaagctac aaaggcgata 
fi44 cgtggtatgt agcgatgtat 
^45 aaggtgacaa gaaaggtgtg 
Q4 6 attgtcgacc tttctcttgt 
P47 aatcttattg tccattttgt 
p49 <210> SEQ ID NO: 6 
"251 <400> SEQUENCE: 6 

W— > 252 000 

254 <210> SEQ ID NO: 7 

255 <211> LENGTH: 33 

256 <212> TYPE: PRT 

257 <213> ORGANISM: Arabidopsis thaliana 

259 <400> SEQUENCE: 7 

260 Trp Tyr Val Ala Met Tyr Arg Glu Ala Lys Glu Cys He Tyr Val Glu 

261 1 5 10 15 
2 62 Pro Asp Arg Glu Gly Asp Lys Lys Gly Val Tyr Trp Tyr Asn Asn Lys 

263 20 25 30 

264 Leu 

268 <210> SEQ ID NO: 8 

269 <211> LENGTH: 30 

270 <212> TYPE: PRT 

271 <213> ORGANISM: Saccharomyces cerevisiae 



ggagcttctt cttcgtaggg tgttcatcgt tattaacgtt 60 

atctccagaa acatgggtgc aggtggaaga atgccggttc 120 

gaaaccgaca ccacaaagcg tgtgccgtgc gagaaaccgc 180 

aagaaagcaa tcccgccgca ttgtttcaaa cgctcaatcc 240 

atcagtgaca tcattatagc ctcatgcttc tactacgtcg 300 

ctccctcagc ctctctctta cttggcttgg ccactctatt 360 

ctaactggta tctgggtcat agcccacgaa tgcggtcacc 420 

tggctggatg acacagttgg tcttatcttc cattccttcc 480 

tggaagtata gtcatcgccg tcaccattcc aacactggat 540 

tttgtcccaa agcagaaatc agcaatcaag tggtacggga 600 

ggacgcatca tgatgttaac cgtccagttt gtcctcgggt 660 

aacgtctctg gcagaccgta tgacgggttc gcttgccatt 720 

tacaatgacc gagaacgcct ccagatatac ctctctgatg 780 

tttggtcttt accgttacgc tgctgcacaa gggatggcct 840 

gtaccgcttc tgatagtgaa tgcgttcctc gtcttgatca 900 

Gcctcgttgc ctcactacga ttcatcagag tgggactggc 960 

gtagacagag actacggaat cttgaacaag gtgttccaca 1020 

gctcatcacc tgttctcgac aatgccgcat tataacgcaa 1080 

aagccaattc tgggagacta ttaccagttc gatggaacac 1140 

agggaggcaa aggagtgtat ctatgtagaa ccggacaggg 1200 

tactggtaca acaataagtt atgagcatga tggtgaagaa 1260 

ctgtttgtct tttgttaaag aagctatgct tcgttttaat 1320 

tgtgttatga cattttggct gctcattatg tt 1372 
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SEQUENCE LISTING 



<110> Martin, Charles E. 
Mitchell, Andrew 



<120> Synthetic Fatty Acid Desaturase Gene 
Expression in Plants 



<130> 97-0081 PCT 

<150> US 60/097,586 
<151> 1998-08-24 

<160> 7 

<170> FastSEQ for windows Version 3.0 

<210> 1 
<211> 1555 
<212> DNA 

<213> Saccharomyces cerevisiae 
<400> 1 

tacaacaaag atgccaactt ctggaactac tattgaattg attgacgacc aatttccaaa 60 

ggatgactct gccagcagtg gcattgtcga cgaagtcgac ttaacggaag ctaatatttt 120 

ggctactggt ttgaataaga aagcaccaag aattgtcaac ggttttggtt ctttaatggg 180 

ctccaaggaa atggtttccg tggaattcga caagaaggga aacgaaaaga agtccaattt 240 

ggatcgtctg ctagaaaagg acaaccaaga aaaagaagaa gctaaaacta aaattcacat 300 

ctccgaacaa ccatggactt tgaataactg gcaccaacat ttgaactggt tgaacatggt 360 

tcttgtttgt ggtatgccaa tgattggttg gtacttcgct ctctctggta aagtaccttt 420 

gcatttaaac gttttccttt tctccgtttt ctactacgct gtcggtggtg tttctattac 480 

tgccggttac catagattat ggtctcacag atcttactcc gctcactggc cattgagatt 540 

attctacgct atcttcggtt gtgcttccgt tgaagggtcc gctaaatggt ggggccactc 600 

tcacagaatt caccatcgtt acactgatac cttgagagat ccttatgacg ctcgtagagg 560 

tctatggtac tcccacatgg gatggatgct tttgaagcca aatccaaaat acaaggctag 720 

agctgatatt accgatatga ctgatgattg gaccattaga ttccaacaca gacactacat 7 80 

cttgttgatg ttattaaccg ctttcgtcat tccaactctt atctgtggtt actttttcaa 840 

cgactatatg ggtggtttga tctatgccgg ttttattcgt gtctttgtca ttcaacaagc 900 

taccttttgc attaactcca tggctcatta catcggtacc caaccattcg atgacagaag 960 

aacccctcgt gacaactgga ttactgccat tgttactttc ggtgaaggtt accataactt 1020 

ccaccacgaa ttcccaactg attacagaaa cgctattaag tggtaccaat acgacccaac 1080 

taaggttatc atctatttga cttctttagt tggtctagca tacgacttga agaaattctc 1140 

tcaaaatgct attgaagaag ccttgattca acaagaacaa aagaagatca ataaaaagaa 12 00 

ggctaagatt aactggggtc cagttttgac tgatttgcca atgtgggaca aacaaacctt 1260 

cttggctaag tctaaggaaa acaagggttt ggttatcatt tctggtattg ttcacgacgt 132 0 

atctggttat atctctgaac atccaggtgg tgaaacttta attaaaactg cattaggtaa 1380 

ggacgctacc aaggctttca gtggtggtgt ctaccgtcac tcaaatgccg ctcaaaatgt 1440 

cttggctgat atgagagtgg ctgttatcaa ggaaagtaag aactctgcta ttagaatggc 1500 

tagtaagaga ggtgaaatct acgaaactgg taagttcttt taagcatcac attac 1555 

<210> 2 
<211> 510 
<212> PRT 

<213> Saccharomyces cerevisiae 



<400> 2 
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Met Pro Thr Ser Gly Thr Thr He Glxi Leu He Asp Asp Gin Phe Pro 

15 10 15 

Lys Asp Asp Ser Ala Ser Ser Gly He Val Asp Glu Val Asp Leu Thr 

20 25 30 

Glu Ala Asn He Leu Ala Thr Gly Leu Asn Lys Lys Ala Pro Arg He 

35 40 45 

Val Asn Gly Phe Gly Ser Leu Met Gly Ser Lys Glu Met Val Ser Val 

50 55 60 

Glu Phe Asp Lys Lys Gly Asn Glu Lys Lys Ser Asn Leu Asp Arg Leu 
65 70 75 80 

Leu Glu Lys Asp Asn Gin Glu Lys Glu Glu Ala Lys Thr Lys He His 

85 90 95 

He Ser Glu Gin Pro Trp Thr Leu Asn Asn Trp His Gin His Leu Asn 

100 105 110 

Trp Leu Asn Met Val Leu Val Cys Gly Met Pro Met He Gly Trp Tyr 

115 120 125 

Phe Ala Leu Ser Gly Lys Val Pro Leu His Leu Asn Val Phe Leu Phe 

130 135 140 

Ser Val Phe Tyr Tyr Ala Val Gly Gly Val Ser He Thr Ala Gly Tyr 
145 150 155 160 

His Arg Leu Trp Ser His Arg Ser Tyr Ser Ala His Trp Pro Leu Arg 

165 170 175 

Leu Phe Tyr Ala He Phe Gly Cys Ala Ser Val Glu Gly Ser Ala Lys 

ISO 185 190 

Trp Trp Gly His Ser His Arg He His His Arg Tyr Thr Asp Thr Leu 

195 200 205 

Arg Asp Pro Tyr Asp Ala Arg Arg Gly Leu Trp Tyr Ser His Met Gly 

210 215 220 

Trp Met Leu Leu Lys Pro Asn Pro Lys Tyr Lys Ala Arg Ala Asp He 
225 230 235 240 

Thr Asp Met Thr Asp Asp Trp Thr He Arg Phe Gin His Arg His Tyr 

245 250 255 

He Leu Leu Met Leu Leu Thr Ala Phe Val He Pro Thr Leu He Cys 

260 265 270 

Gly Tyr Phe Phe Asn Asp Tyr Met Gly Gly Leu He Tyr Ala Gly Phe 

275 280 285 

He Arg Val Phe Val He Gin Gin Ala Thr Phe Cys He Asn Ser Leu 

290 295 300 

Ala His Tyr He Gly Thr Gin Pro Phe Asp Asp Arg Arg Thr Pro Arg 
305 310 315 320 

Asp Asn Trp He Thr Ala He Val Thr Phe Gly Glu Gly Tyr His Asn 

325 330 335 

Phe His His Glu Phe Pro Thr Asp Tyr Arg Asn Ala He Lys Trp Tyr 

340 345 350 

Gin Tyr Asp Pro Thr Lys Val He He Tyr Leu Thr Ser Leu Val Gly 

355 360 365 

Leu Ala Tyr Asp Leu Lys Lys Phe Ser Gin Asn Ala He Glu Glu Ala 

370 375 380 

Leu He Gin Gin Glu Gin Lys Lys He Asn Lys Lys Lys Ala Lys He 
385 390 395 400 

Asn Trp Gly Pro Val Leu Thr Asp Leu Pro Met Trp Asp Lys Gin Thr 

405 410 415 

Phe Leu Ala Lys Ser Lys Glu Asn Lys Gly Leu Val He He Ser Gly 

420 425 430 

He Val His Asp Val Ser Gly Tyr He Ser Glu His Pro Gly Gly Glu 

435 440 445 

Thr Leu He Lys Thr Ala Leu Gly Lys Asp Ala Thr Lys Ala Phe Ser 

450 455 460 

Gly Gly Val Tyr Arg His Ser Asn Ala Ala Gin Asn Val Leu Ala Asp 
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465 470 475 480 

Met Arg Val Ala Val He Lys Glu Ser Lys Asn Ser Ala He Arg Met 

485 490 495 

Ala Ser Lys Arg Gly Glu He Tyr Glu Thr Gly Lys Phe Phe 
500 505 510 

<210> 3 
<211> 1555 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> synthetic yeast delta- 9 desaturase gene modified 
for expression in plants 

<400> 3 

ggatccaaca atgcctactt ctggaactac tatcgagctt atcgatgatc aattccctaa 60 

ggatgattct gcttcttctg gaatcgttga tgaggttgat cttactgagg ctaacatcct 12 0 

tgctactgga cttaacaaga aggctcctag aatcgttaac ggattcggat ctcttatggg 180 

atctaaggag atggtttctg ttgagttcga taagaaggga aacgagaaga agtctaacct 240 

tgatagactt cttgagaagg ataaccaaga gaaggaggag gctaagacta agatccatat 300 

ctctgagcaa ccttggactc tcaacaactg gcatcaacat ctcaactggc tcaacatggt 360 

gctcgtctgt ggaatgccta tgatcggatg gtacttcgct ctctctggaa aagtgcctct 420 

ccatctcaac gttttcctct tctctgtctt ctactacgct gttggaggag tgtctatcac 4 80 

tgctggatac catagactct ggtctcatag atcttactct gctcattggc ctcttagact 540 

cttctacgct atctttggat gtgcttctgt tgagggatct gctaagtggt ggggacattc 600 

tcatagaatc catcatagat acactgatac tcttagagat ccttacgatg ctagaagagg 660 

actttggtac tctcatatgg gatggatgct tcttaagcct aaccctaagt acaaggctag 720 

agctgatatc actgatatga ctgatgattg gactatcaga ttccaacata gacattacat 780 

cttgctcatg ctccttactg ctttcgtgat ccctactctc atctgtggat acttcttcaa 840 

cgattacatg ggaggactca tctacgctgg attcatcaga gtgttcgtca tccaacaagc 900 

tactttctgt atcaactcta tggctcatta catcggaact caacctttcg atgatagaag 960 

aactcctaga gataactgga tcactgctat cgttactttc ggagagggat accataactt 1020 

ccatcatgag ttccctactg attatagaaa cgctatcaag tggtaccaat acgatcctac 10 30 

taaagtgatc atctacttga cttctctcgt gggacttgct tacgatctca agaagttctc 1140 

tcaaaacgct atcgaggagg ctcttatcca acaagagcaa aagaagatca acaagaagaa 1200 

ggctaagatt aattggggac ctgttcttac tgatcttcct atgtgggata agcaaacttt 1260 

ccttgctaag tctaaggaga acaagggact tgttatcatc tctggaatcg ttcatgatgt 1320 

ttctggatac atctctgagc atcctggagg agagacttta attaagactg ctcttggaaa 1380 

ggatgctact aaggctttct ctggaggagt ttacagacat tctaacgctg ctcaaaacgt 1440 

gcttgctgat atgagagttg ctgttatcaa ggagtctaag aactctgcta tcagaatggc 1500 

ttctaagaga ggagagatct acgagactgg aaagttcttc tgatctagag gatcc 1555 





<210> 


4 


























<211> 


383 


























<212> 


PRT 


























<213> 


Arabidopsis 


thaliana 


















<400> 


4 
























Met 


Gly Ala 


Gly Gly 


Arg 


Met 


Pro 


Val 


Pro 


Thr 


Ser 


Ser 


Lys 


Lys 


Ser 


1 




5 










10 










15 




Glu 


Thr Asp 


Thr Thr 


Lys 


Arg 


Val 


Pro 


Cys 


Glu 


Lys 


Pro 


Pro 


Phe 


Ser 






20 








25 










30 






Val 


Gly Asp 


Leu Lys 


Lys 


Ala 


He 


Pro 


Pro 


His 


Cys 


Phe 


Lys 


Arg 


Ser 




35 








40 










45 








He 


Pro Arg 


Ser Phe 


Ser 


Tyr 


Leu 


He 


Ser 


Asp 


He 


He 


He 


Ala 


Ser 




50 






55 










60 










Cys 


Phe Tyr 


Tyr Val 


Ala 


Thr 


Asn 


Tyr 


Phe 


Ser 


Leu 


Leu 


Pro 


Gin 


Pro 


65 






70 










75 










80 
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Leu 


Ser 


Tyr 


Leu 


Ala 


Trp 


Pro 


Leu 


Tyr 


Trp 


Ala 


Cys 


Gin 


Gly 


Cys 
95 


Val 


Leu 


Thr 


Gly 


He 
100 


Trp 


Val 


lie 


Ala 


His 


Glu 


Cys 


Gly 


His 


His 
110 


Ala 


Phe. 


er 


sp 


Tyr Gin 


Trp 


Leu 


Asp 


Asp 


Thr 


Val 


Gly Leu 


He 


Phe 


His 


Ser 






115 










12 0 










125 










Leu 


Leu 


Val 


Pro 


Tyr 


Phe 
135 


Ser 


Trp 


Lys 


Tyr 


Ser 
140 


His 


Arg 


Arg 


His 




Ser 


Asn 


Thr 


Gly 


Ser 


Leu 


Glu 


Arg 


Asp 


Glu 


Val 


Phe 


Val 


Pro 


Lys 












150 










155 










160 




ys 


Ser 


Ala 


He 
165 


Lys 


Trp 


Tyr 


Gly 


Lys 
17 0 


Tyr 


Leu 


Asn 


Asn 


Pro 
175 


Leu 


Gly 


Arg 


lie 


Met 
180 


Met 


Leu 


Thr 


Val 


Gin 
185 


Phe 


Val 


Leu 


Gly 


Trp 
190 


Pro 


Leu 


Tyr 


Leu 


Ala 


Phe 


Asn 


Val 


Ser 


Gly Arg 


Pro 


Tyr Asp 


Gly 


Phe 


Ala 


Cys 






195 










200 










205 








His 


Phe 


Phe 


Pro 


Asn 


Ala 


Pro 


He 


Tyr Asn 


Asp 


Arg 


Glu 


Arg 


Leu 


Gin 




210 










215 










220 










lie 


Tyr 


Leu 


Ser 


Asp 


Ala 


Gly 


He 


Leu 


Ala 


Val 


Cys 


Phe 


Gly 


Leu Tyr 


225 










230 










235 










240 


Arg 


Tyr 


Ala 


Ala 


Ala 


Gin 


Gly Met 


Ala 


Ser 


Met 


He 


Cys 


Leu 


Tyr 


Gly 










245 










250 










255 




Val 


Pro 


Leu 


Leu 
260 


He 


Val 


Asn 


Ala 


Phe 
265 


Leu 


Val 


Leu 


He 


Thr 
270 


Tyr 


Leu 


Gin 


His 


Thr 
275 


His 


Pro 


Ser 


Leu 


Pro 
280 


His 


Tyr 


Asp 


Ser 


Ser 
285 


Glu 


Trp 


Asp 


Trp 


290 


Arg 


Gly 


Ala 


Leu 


Ala 
295 


Thr 


Val 


Asp 


Arg 


Asp 
300 


Tyr 


Gly 


He 


Leu 


Asn 


Lys 


Val 


Phe 


His 


Asn 


He 


Thr 


Asp 


Thr 


His 


Val 


Ala 


His 


His 


Leu 


305 










310 










315 










320 


Phe 


Ser 


Thr 


Met 


Pro 


His 


Tyr Asn Ala Met 


Glu 


Ala 


Thr 


Lys 


Ala 


He 










325 










330 










335 




Lys 


Pro 


lie 


Leu 


Gly 


Asp 


Tyr 


Tyr 


Gin 


Phe 


Asp 


Gly Thr 


Pro 


Trp 


Tyr 








340 










345 










350 






Val 


Ala 


Met 


Tyr 


Arg 


Glu 


Ala 


Lys 


Glu 


Cys 


He 


Tyr Val 


Glu 


Pro Asp 






355 










360 










365 








Arg 


Glu 


Gly Asp 


Lys 


Lys 


Gly Val 


Tyr 


Trp 


Tyr 


Asn 


Asn 


Lys 


Leu 






370 










375 










380 











<210> 5 
<211> 1372 
<212> DNA 

<213> Arabidopsis thaliana 
<400> 5 

agagagagag attctgcgga ggagcttctt cttcgtaggg tgttcatcgt tattaacgtt 60 

atcgccccta cgtcagctcc atctccagaa acatgggtgc aggtggaaga atgccggttc 120 

ctacttcttc caagaaatcg gaaaccgaca ccacaaagcg tgtgccgtgc gagaaaccgc 180 

ctttctcggt gggagatctg aagaaagcaa tcccgccgca ttgtttcaaa cgctcaatcc 24 0 

ctcgctcttt ctcctacctt atcagtgaca tcattatagc ctcatgcttc tactacgtcg 300 

ccaccaatta cttctctctc ctccctcagc ctctctctta cttggcttgg ccactctatt 360 

gggcctgtca aggctgtgtc ctaactggta tctgggtcat agcccacgaa tgcggtcacc 42 0 

acgcattcag cgactaccaa tggctggatg acacagttgg tcttatcttc cattccttcc 48 0 

tcctcgtccc ttacttctcc tggaagtata gtcatcgccg tcaccattcc aacactggat 54 0 

ccctcgaaag agatgaagta tttgtcccaa agcagaaatc agcaatcaag tggtacggga 60 0 

aatacctcaa caaccctctt ggacgcatca tgatgttaac cgtccagttt gtcctcgggt 660 

ggcccttgta cttagccttt aacgtctctg gcagaccgta tgacgggttc gcttgccatt 72 0 

tcttccccaa cgctcccatc tacaatgacc gagaacgcct ccagatatac ctctctgatg 780 

cgggtattct agccgtctgt tttggtcttt accgttacgc tgctgcacaa gggatggcct 840 
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cgatgatctg cctctacgga gtaccgcttc tgatagtgaa tgcgttcctc gtcttgatca 90 0 

cttacttgca gcacactcat ccctcgttgc ctcactacga ttcatcagag tgggactggc 960 

tcaggggagc tttggctacc gtagacagag actacggaat cttgaacaag gtgttccaca 1020 

acattacaga cacacacgtg gctcatcacc tgttctcgac aatgccgcat tataacgcaa 1080 

tggaagctac aaaggcgata aagccaattc tgggagacta ttaccagttc gatggaacac 1140 

cgtggtatgt agcgatgtat agggaggcaa aggagtgtat ctatgtagaa ccggacaggg 1200 

aaggtgacaa gaaaggtgtg tactggtaca acaataagtt atgagcatga tggtgaagaa 1260 

attgtcgacc tttctcttgt ctgtttgtct tttgttaaag aagctatgct tcgttttaat 132 0 

aatcttattg tccattttgt tgtgttatga cattttggct gctcattatg tt 1372 



<210> 6 
<211> 33 
<212> PRT 

<213> Arabidopsis thaliana 



<400> 7 

Trp Tyr Val Ala Met Tyr Arg Glu 

1 5 
Pro Asp Arg Glu Gly Asp Lys Lys 
20 

Leu 



Ala Lys Glu Cys lie Tyr Val Glu 

10 15 
Gly Val Tyr Trp Tyr Asn Asn Lys 
25 30 



<210> 7 
<211> 30 
<212> PRT 

<213> Saccharomyces cerevisiae 
<400> 7 

Met Arg Val Ala Val lie Lys Glu Ser Lys Asn Ser Ala lie Arg Met 

15 10 15 

Ala Ser Lys Arg Gly Glu He Tyr Glu Thr Gly Lys Phe Phe 
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SYNTHETI C FATTY ACID DESATURASE GENE 
FOR EXPRESS ION IM LJPLANTS 

This application claims priority to U.S. 
Provisional Application No. 60/097,586, filed August 24, 
1998, the entirety of which is incorporated by reference 
herein. 

5 

FIELD OF THE INVENTION 

This invention relates to the field of genetic 
engineering, and more particularly to transformation of 
plants with heterologous fatty acid desaturase genes 
10 modified for optimum expression in plants. 

BACKGROHND OF THE INVENTION 

Several publications are referenced in this 
application in order to more fully describe the state of 
15 the art to which this invention pertains. The disclosure 
of each of these publications is incorporated by 
reference herein. 

Alteration of fatty acid desaturation in plants 
is of interest to plant biologists and food scientists 

2 0 alike, due to the influence of unsaturated fatty acids on 

the health benefits and flavors of foods, as well as the 
role of these molecules in plant biological processes. 
For a nation interested in healthy diet, the quality of 
fats and oils depends on their fatty acid composition, 
25 with oils high in monounsaturated fatty acids (e.g., 

canola, olive) gaining popularity as new health benefits 
are discovered. Considering the flavors of plant foods, 
many flavor-producing compounds are derived from 
peroxidation of unsaturated fatty acids. Thus, efforts 

3 0 are being made to produce plants with increased amounts 
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of unsaturated fatty acids, preferably monounsaturated 
fatty acids. 

In animal and fungal cells, monounsaturated 
fatty acids are aerobically synthesized from saturated 
5 fatty acids by a microsomal A-9 fatty acid desaturase 
that is membrane bound and cytochrome bg-dependent . A 
double bond is inserted between the 9- and 10-carbons of 
palmitoyl (16:0) and stearoyl (18:0) CoA to form 
palmitoleic (16:1) and oleic (18:1) acids. In the 

10 reaction mechanism, electrons are transferred from NADH- 

dependent cytochrome bg reductase, via the heme- containing 
cytochrome (Cyt bg) molecule, to the A-9 fatty acid 
desaturase. The major form of cytochrome in animal, 
fungal and plant cells exists as an independent protein 

15 molecule that is anchored to the membrane by a short, 
carboxyl terminal, hydrophobic stretch of amino acids. 
The carboxyl terminal anchor orients the heme group of 
the Cyt bg on the membrane surface and allows it to 
trans 1 at ionally diffuse across the surface of the 

2 0 membrane. This property of lateral mobility allows this 
form of cytochrome bg to participate as an electron donor 
to a number of different proteins that catalyze a variety 
metabolic reactions on the membrane surface, including 
fatty acid desaturases, various sterol biosynthetic 

25 enzymes and a variety of cytochrome P450 mediated 

reactions . While this contributes to the versatility of 
Cyt bj as an electron donor, it also implies that the 
major form of cytochrome shuttles between its redox 
partners by translational diffusion across the surface of 

30 the membrane (Strittmatter and Rogers, Proc . Natl. Acad. 
Sci. USA, 72: 2658-2661, (1975; Lederer, Biochimie 76: 
674-692, 1994). Furthermore, this mechanism suggests 
that an independent, membrane bound cytochrome molecule 



wo 00/11012 



PCT/US99/19443 



can potentially limit the rate of the metabolic reaction, 
depending on its abundance, its location on the membrane 
surface, its proximity to the electron acceptor, and the 
rate at which it can move and orient itself to the 
5 acceptor on the membrane surface. 

In plants, unsaturated fatty acids are formed 
and incorporated into complex lipids in two distinct 
cellular compartments, De novo fatty acid synthesis 
occurs almost exclusively in the plastids, producing the 

10 saturated species 16:0-ACP (acyl carrier protein) and 
18:0-ACP. 18:1-ACP is formed from 18:0-ACP in the 
plastid by a soluble, ferredoxin- dependent A- 9 
desaturase. These fatty acids are then shunted into one 
of two routes - a plast id-localized "procaryotic" pathway 

15 or a cytosolic/ER (endoplasmic reticulum) "eucaryotic" 
pathway - for further modification and acylation into 
glycerolipids (Soraerville and Browse, Science 252 : 80-87, 
1991) . The acyl ACPs that are shunted into the 
prokaryotic pathway remain within the plastid and are 

2 0 used for the synthesis of phosphatidic acid and further 

conversion to chloroplast glycerolipids. The fatty acyl 
groups of those lipids may be further desaturated by 
plastid desaturases that also use ferrodoxin as the 
electron donor. 

25 Acyl -ACPs that are shunted into the eukaryotic 

pathway are converted to free fatty acids, transported 
across the chloroplast membrane into the cytoplasm where 
they are converted to acyl CoA thioesters by acyl CoA 
synthetase. Those fatty acids are then converted to 

3 0 cytoplasmic/ER phosphatidic acid which can then be 

converted to membrane glycerophospholipids , or storage 
lipids, in the form of triacylglycerols and sterol esters 
that are the major components of plant oils. 
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Most polyunsaturated 18 -carbon plant fatty- 
acids appear to be formed in the cytosol by the ER-bound 
desaturases (Table 1). Once the 18:1 fatty acid is 
incorporated into phospholipid, an ER-bound desaturase 
5 can catalyze the formation of a A- 12 double bond in the 
fatty acyl chain to form A-9,12 18:2. Other ER bound 
desaturase enzymes can act on 18:2 to introduce a A-15 
double bond to form A9,12,15 18:3. These desaturase are 
thought to be similar to animal and fungal desaturases 
10 because they are membrane bound and appear to require a 
cytochrome bj-mediated electron transport chain. 



TABLE 1: 



20 



25 



Plant 


Gene 


Desaturase 
Type 


Primary 
Activity 


b5 
chimera 


Reference 


Arabidopsis 


FAD2 


A12, 

microsomal 


18:I->18:2 


no 


OkuleyJ. et al. 
Plant Cell 6: 147- 
158, 1994 


Arabidopsis 


FADS 


A15, 

microsomal 


18:2->18:3 


no 


Shah S. & Z. Xin, 
Plant Physiol. 114: 
1533-1539, 1997 


Nicotiana 
tabacum 


NtFA 
D3 


A15, 

microsomal 


18:2->18:3 


no 


HamadaT. etal. 
Plant & Cell. 
Physiol. 37: 606- 
611, 1996, 
Hamada T. et al. 
Transgenic Res. 5: 
115-121, 1996 


Soybean 


FAD 
2-1 


A12, 

microsomal, 

developing 

seeds 


18:l-> 
18:2 


no 


Heppard E.P. et al. 
Plant Physiol. 110: 
311-319, 1996 


Soybean 


FAD 
2-2 


A12, 

microsomal 
developing 
seeds and 
vegetative 
tissues 


18:1->18;2 


no 


Heppard, E.P. et al. 
1996, supra 
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Borage 




A-6 


18:2 


yes, N- 


Sayanova et al. 








(9,12)-18:3 


terminal 


Proc. Natl. Acad. 








(6,9,12) 




Sci. USA 94:4211- 












4216, 1997 



The conversion of saturated fatty acyl chains 
5 to monounsaturated species in plants appears to be 

confined to the chloroplasts . No A-9 desaturase activity 
has been identified in the cytoplasm or endoplasmic 
reticulum of plants. The soluble plant chloroplast A-9 
desaturase is highly specific for 18:0-ACP as a substrate 

10 and does not desaturate 16:0-ACP (Somerville and Browse, 
supra). As a result, only a small amount of 16:1 is 
present in most higher plants, while the pool of 16:0 is 
concomitantly larger due to its disfavor as a substrate 
for the plant desaturase. By comparison, a larger amount 

15 of 18:1 is found in higher plant cells, with a 

correspondingly lesser amount of 18:0. Thus, for the 
purpose of increasing the concentration of mono- 
unsaturated lipids in a plant, the 16:0 fatty acid 
constitutes a significant pool of available substrate 

2 0 that is under-utilized by the endogenous plant 
desaturase . 

In contrast to the plant A-9 desaturase, fungal 
and animal A-9 desaturases efficiently convert a wide 
range of saturated fatty acids with differing hydrocarbon 

2 5 chain lengths to monounsaturated fatty acids. The 

Saccharomyces cerevisiae enyzme, for example, efficiently 
desaturates even and odd chain fatty acyl CoA substrates 
from 13 carbons to 19 carbons in length. A broad 
functional homology exists among various Cyt bj-dependent 

3 0 desaturases, as evidenced, for example, by the successful 

expression of the rat A-9 desaturase in yeast (Stukey et 
al., J. Biol. Chem. 265 : 20144-20149, 1990). 
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The rat and yeast A- 9 desaturase genes have 
been expressed in plants: both the rat and the yeast 
genes have been expressed in tobacco (Grayburn et al . , 
BioTechnology 10: 675-678, 1992 (rat); Polashock et al . , 
Plant Physiol. 100 : 894-901, 1992 (yeast), and the yeast 
gene has also been expressed in tomato (Wang et al . , J. 
Agric. Food Chem. 44; 3399-3402, 1996) . The yeast A-9 
desaturase has been shown to function in tobacco and 
tomato, leading to increases in the level of 
monounsaturated fatty acids (both 16:1 and 18:1) and 
other compounds derived from monounsaturated fatty acids 
(e.g., polyunsaturated fatty acids, hexanal, 1-hexanol, 
heptanal, t^rans- 2 -octenal) (Polashock et al . , supra; Wang 
et al; supra) . Expression of the rat desaturase also led 
to an increase in monounsaturated 16- and 18-carbon fatty 
acids (Grayburn et al . , supra) . 

From the foregoing, it can be seen that 
transgenic plants expressing animal or fungal A-9 
desaturase genes can be improved in their unsaturated 
fatty acid composition by virtue of the activity of the 
foreign enzyme. Of further advantage, it has recently 
been discovered that some fungal A-9 desaturases (e.g., 
Saccharomyces cerevisiae) are fusion proteins comprising 
an intrinsic Cyt bg domain (Mitchell &: Martin, J. Biol. 
Chem. 270: 29766-29772, 1995) . When this gene is 
expressed, sufficient Cyt bg is produced to drive the 
desaturase reaction at an optimum level and is not 
dependent on existing plant Cyt b^. The known animal A-9 
desaturases do not contain this fused Cyt bg motif and 
must rely on independently-produced Cyt bg to provide the 
electrons for the reactions. 

Though fungal or animal A-9 desaturases (e.g. 
the S. cerevisiae desaturase or the animal desaturases) 
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may be expressed and functional in certain plants, their 
expression is likely less than optimal in plants, and 
expression may not even be possible in other plant 
species, due to several factors, including differences in 
codon usage and codon preference in plants as compared to 
fungi, and among different plant species and the presence 
of cryptic intron splicing signals, among others. All of 
these factors can lead to poor expression, or no 
expression, of a non-plant foreign gene in a plant cell . 

Accordingly, in order to make use of non-plant 
fatty acid desaturases, particularly those such as the S. 
cerevisiae A- 9 desaturase comprising an internal Cyt bg 
motif, a need exists to design modified desaturase- 
encoding DNA molecules that are customized for expression 
in plant cells and specific plant tissues. It would be 
of even greater advantage to optimize such modified DNA 
molecules for expression in particular plant species, 
such as those that are grown and harvested primarily for 
oils . 

SUMMARY OF THE INVENTION 

According to one aspect of the invention, a 
synthetic fatty acid desaturase gene for expression in a 
multi-cellular plant is provided, the gene comprising a 
desaturase domain and a Cyt domain, wherein the gene is 
customized for expression in a plant cytoplasm. In one 
embodiment, the synthetic gene is customized for 
expression in a monocotyledonous plant . In another 
embodiment, the synthetic gene is customized for 
expression in a dicotyledonous plant. In a preferred 
embodiment, the synthetic gene is customized for 
expression in a plant genus selected from the group 
consisting of Arahidopsis , Brassica, Phaeseolus , Oryza, 
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Olea, Elaeis (Oil Palm) and Zea. 

In a preferred embodiment of the invention, the 
desaturase is a cytosolic A-9 desaturase . The 
Saccharomyces cerevisiae A-9 desaturase is particularly 
5 preferred. 

In another embodiment of the invention, the 
synthetic gene is customized from a naturally occurring 
gene comprising both a desaturase domain and a cyt 
domain. Alternatively, the synthetic gene is a chimeric 

10 gene comprising a desaturase domain and a heterologous 
cyt bs domain. 

In another embodiment, the synthetic gene is 
customized from a naturally occurring gene such that the 
synthetic gene and the naturally occurring gene encode an 

15 identical amino acid sequence. Alternatively, the 

synthetic gene is customized from a naturally occurring 
gene such that the synthetic gene and the naturally 
occurring gene encode a similar and functionally 
conserved amino acid sequence . 

20 In another embodiment, a naturally occurring or 

a synthetic gene is customized so that specific amino 
acid modification are made to enhance the function of the 
encoded protein. Examples of such modifications include 
changing amino acids that are subjected to 

25 phosphorylation or other post-translational modifications 
that may alter or regulate the activity of the A-9 
desaturase enzyme. 

In another embodiment of the invention, 
elements of a naturally occurring or a synthetic 

30 desaturase gene that are not essential for enzymatic 
function are replaced or linked with elements derived 
from plant ER lipid biosynthetic genes that are normally 
expressed in maturing seeds or other plant tissues. The 
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improved expression of the modified gene produced by the 
inclusion or substitution of plant DNA sequences in the 
synthetic gene will result from native plant signal or 
control elements in those sequences that affect 
desaturase gene expression at one or more levels. 

According to another aspect of the invention, a 
method is provided for constructing and customizing a 
bifunctional desaturase/cyt b5 encoding gene for 
expression in the cytosol of a multicellular plant. The 
method comprises (a) providing a DNA molecule comprising 
a desaturase-encoding moiety operably linked to a cyt bg- 
encoding moiety, said DNA molecule producing the 
bifunctional polypeptide in a non- customized form; (b) 
back-translating the polypeptide sequence using preferred 
codons for expression in a multicellular plant, thereby 
producing a back- translated nucleotide sequence; (c) 
analyzing the back- translated nucleotide sequence for 
features that could diminish or prevent expression in the 
plant cytoplasm, including, optionally (1) probable 
intron splice sites (characterized by T-rich regions) ; 
(2) plant polyadenylation signals (e.g., AATAAA) ; (3) 
polymerase II termination sequence (e.g., CAN^^.gjAGTNNAA, 
where N is any nucleotide) ; (4) hairpin consensus 
sequences (e.g., UCUUCGG) ; and (5) the sequence- 
destabilizing motif ATTTA; (d) modifying the analyzed 
sequence to correct or remove the features that could 
diminish or prevent expression in the plant cytoplasm; 
and, optionally, (e) introducing desirable cloning 
features, such as restriction sites, into the sequence in 
a manner that does not materially affect the desired 
codon usage or final polypeptide sequence. 

The method set forth above may be adapted by 
incorporating into the customized gene one or more 
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genomic segments from plant desaturase or other ER lipid 
biosynthetic genes, which are determined to further 
optimize gene expression in plants. This method 
comprises (1) identifying cDNA sequences that have 
potential to comprise such beneficial elements, (2) 
creating yeast vectors expressing desaturase genes 
modified to contain these elements, (3) testing the 
vectors in a yeast expression system, (4) isolating 
regions from genomic DNA that are homologous to the 
beneficial cDNA elements, and (6) using them to construct 
chimeric or hybrid synthetic genes that produce 
functional and highly efficient desaturase activities in 
plant tissues. 

Other features and advantages of the present 
invention will be better understood by reference to the 
drawings, detailed description and examples that follow. 

BRIEF DESCRIPTION OF THE DRAWINGS 

Figure 1. GCG Pileup comparison of stearoyi- 
CoA desaturase protein sequences . Sequences containing a 
Cyt bj domain are indicated with a +; sequences lacking a 
Cyt bj domain are indicated with a - ; sequences still in 
question are indicated with a ? . 

Fignure 2. GCG Pileup comparison of Cytochrome 
bg protein sequences. 

DETAILED DESCRIPTION OF THE INVENTION 
I. Definitions 

Various terms relating to the biological 
molecules of the present invention are used herein above 
and also throughout the specifications and claims. 

The term "promoter region" refers to the 5 ' 
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regulatory regions of a gene. 

The term "reporter gene" refers to genetic 
sequences which may be operably linked to a promoter 
region forming a transgene, such that expression of the 
reporter gene coding region is regulated by the promoter 
and expression of the transgene is readily assayed. 

The term "selectable marker gene" refers to a 
gene product that when expressed confers a selectable 
phenotype, such as antibiotic resistance, on a 
transformed cell or plant. 

The terra "operably linked" means that the 
regulatory sequences necessary for expression of the 
coding sequence are placed in the DNA molecule in the 
appropriate positions relative to the coding sequence so 
as to effect expression of the coding sequence. This 
same definition is sometimes applied to the arrangement 
of coding sequences and transcription control elements 
(e.g. promoters, enhancers, and termination elements) in 
an expression vector. 

The term "DNA construct" refers to genetic 
sequence used to transform plants and generate progeny 
transgenic plants . These constructs may be administered 
to plants in a viral or plasmid vector. Other methods of 
delivery such as Agrobacterium T-DNA mediated 
transformation and transformation using the biolistic 
process are also contemplated to be within the scope of 
the present invention. The transforming DNA may be 
prepared according to standard protocols such as those 
set forth in "Current Protocols in Molecular Biology", 
eds. Frederick M. Ausubel et al . , John Wiley & Sons, 
1999. 

This invention provides synthetic DNA molecules 
(sometimes referred to herein as "synthetic genes") that 
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encode a fatty acid desaturase useful for modifying the 
fatty acid composition of a plant. The DNA molecules 
describe in accordance with this invention are superior 
to DNA molecules currently available for this purpose, in 
two important respects: (1) they encode a dual -domain 
polypeptide (sometimes referred to herein as a 
"bifunctional polypeptide or protein"), one domain being 
the fatty acid desaturase, and the other domain being 
cytochrome h^, a protein required to support the electron 
transfer events that enable the desaturase to function; 
and (2) they are customized for expression in the cytosol 
of plant cells, and further customized for expression in 
particular selected plant species. 

Design of synthetic genes of the present 
invention is accomplished in two broad steps. First, the 
two components (the desaturase-encoding component and the 
Cyt bg-encoding component) are selected and linked 
together, if they do not occur together naturally. 
Second, the DNA molecule is optimized for expression in 
the cytosol of a plant cell, or further for expression in 
a particular plant species, or group of species. 

With regard to the first step, it should be 
noted that several fungal, animal and plant species, 
including yeast, are now known to contain naturally- 
occurring genes encoding dual -domain cytoplasmic fatty 
acid desaturases. As mentioned above, the yeast and rat 
A- 9 desaturase genes have been expressed and shown to 
function in plants. However, prior to the present 
invention, it was not appreciated that the bifunctional 
yeast desaturase offers a significant advantage over the 
single- function animal desaturase in plant ceils, where 
the requisite Cyt bg is available only in small amounts, 
and the yeast protein can provide its own supply of Cyt 
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bs. 

With regard to the second step - optimization 
for expression in the plant cytosol - it was discovered 
in accordance with the present invention that a non-plant 
desaturase- encoding gene, such as the yeast OLE 1, though 
expressed in some plants, may not be optimally expressed 
in those plants. Furthermore, the inventors have found 
that the yeast gene is poorly expressed in other plant 
species, thus highlighting the advantages obtainable by 
optimizing such a gene for expression in a plant cell. 

Sections II-IV below describe in detail how to 
design and use the synthetic genes of the present 
invention. To the extent that specific materials are 
mentioned, it is merely for purposes of illustration and 
is not intended to limit the invention. Unless otherwise 
specified, general biochemical and molecular biological 
procedures, such as those set forth in Sambrook et al . , 
Molecul ar Cloning . Cold Spring Harbor Laboratory (1989) 
(hereinafter "Sambrook et al . " ) or Ausubei et al . (eds) 
Current Protocols in Molecular Biology . John Wiley & Sons 
(1999) (hereinafter "Ausubei et al . " ) are used. 

II- Design and construction of the SYnthetic DNA molecules 
A. Selection of component DNA segments 

This invention contemplates the use of the 
following source DNAs, which are thereafter modified for 
expression in plants, if necessary: 

1. naturally occurring genes or cDNAs that 
encode dual domain polypeptides comprising a desaturase 
domain and a Cyt bg domain; 

2 . chimeric genes in which a desaturase- 
encoding sequence from one source (e.g., the desaturase 
domain of a dual domain fungal A- 9 desaturase, or the 



wo 00/11012 



PCT/US99/19443 



-14- 

single domain rat desaturase) , is linked to a Cyt bg- 
encoding sequence from a different source (e.g., a 
plant) ; 

3 . chimeric genes in which a sequence that 
encodes a fragment of a naturally occurring plant Cyt bg 
(e.g. the heme binding fold, or residues that comprise 
the electron donor or acceptor sites, or residues that 
act as membrane targeting or retention signals, or 
residues that act to stabilize the protein in the plant 
cytoplasmic environment) is substituted for homologous 
regions in the cytochrome domain of a dual domain 
polypeptide such as the yeast A- 9 desaturase; and 

4. chimeric genes in which elements that encode 
the essential enzymatic domains from one source (e.g. a 
native or synthetic gene derived from a fungal A-9 
desaturase) are linked to elements derived from native 
plant desaturases that enhance transcription, raRNA 
processing, mRNA stability, protein folding and 
maturation, membrane targeting or retention, or protein 
stability. 

Naturally occurring genes or cDNAs that encode 
dual domain desaturase/Cyt b^ proteins have been 
identified in several fungal species, including 
Saccharomyces cerevisiae, Pichia augusta, Histoplasma 
capsulatum and Cryptococcus curvatus (See Fig. 1) . 
Naturally occurring genes or cDNA=s that encode 
independent, diffusible Cyt bj proteins have been 
identified in several plant species, including Nicotiana 
tabacum (tobacco) , Oryza sativa (rice) , Cuscuta reflexa 
(southern Asian dodder) , Arabidopsis thaliana, Brassica 
oleracea and Olea europaea (olive) . A N- terminal Cyt bg 
domain of a A- 6 desaturase has also been identified in 
the plant Borag-o officinalis, and in the Saccharomyces 
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cerevisiae FAHl gene that encodes a very long chain fatty 
acid hydroxylase. Genes or cDNAs from these species, as 
well as DNA from any other species identified in the 
future as encoding such a dual domain protein, are 
contemplated for use in the synthetic genes of the 
present invention. 

In a preferred embodiment , the yeast OLEl gene 
is used- This embodiment is described in detail in 
Example 1 . 

The second strategy involves linking a DNA 
segment encoding a fatty acid desaturase from one source 
with a Cyt bg domain from another source. In a preferred 
embodiment, this chimeric gene is fashioned after the 
naturally-occurring dual function genes discussed above. 
That is, the Cyt bj domain and the desaturase domain are 
situated in the same positions respective to each other 
as is found in the naturally occurring genes (see, e.g., 
Mitchell & Martin, J. Biol. Chem. 270 : 29766-29772, 
1996) . 

The chimeric dual -domain proteins of the 
invention are prepared by recombinant DNA methods, in 
which DNA sequences encoding each domain are operably 
linked together such that upon expression, a fusion 
protein having the desaturase and Cyt bg functions 
described above is produced. As defined above, the term 
"operably linked" means that the DNA segments encoding 
the fusion protein are assembled with respect to each 
other, and with respect to an expression vector in which 
they are inserted, in such a manner that a functional 
fusion protein is effectively expressed. The selection 
of appropriate promoters and other 5 • and 3 ' regulatory 
regions, as well as the assembly of DNA segments to form 
an open reading frame, employs standard methodology well 
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known to those skilled in the art. 

Thus, preparing the chimeric DNAs of the 
invention involves selecting DNA sequences encoding each 
of the aforementioned components and operably linking the 
5 respective sequences together in an appropriate vector. 
The sequences are thereafter expressed to produce the 
dual -function protein. 

Genes or cDNAs that encode single -function 
cytoplasmic A- 9 fatty acid desaturases have been 
10 identified in a diverse array of procaryotic and 

eucaryotic species, including insects, fungi and mammals, 
but not plants (Fig. 1) . Genes or cDNAs from any of 
these species, as well as DNA from any other species 
identified in the future as encoding a fatty acid 
15 desaturase, are contemplated for use in the synthetic 
genes of the present invention. 

In preferred embodiments, desaturase-encoding 
genes from eucaryotes , most preferably fungi or mammals, 
are used. In a particularly preferred embodiment, a DNA 

2 0 encoding the rat stearoyl CoA desaturase is used. This 

DNA has been successfully expressed in tobacco, and 
accordingly is expected to be useful as part of a 
chimeric desaturase /Cyt bj gene of the present invention. 

Genes or cDNAs that encode Cyt bg proteins have 
25 also been identified in a diverse array of eucaryotic 
species, including insects, fungi, mammals and plants. 
Genes or cDNAs from any of these species, as well as DNA 
from any other species identified in the future as 
encoding a Cyt protein, are contemplated for use in the 

3 0 synthetic genes of the present invention. 

In preferred embodiments, Cyt b^-encoding genes 
or cDNAs from plants are used. These DNAs are preferred 
because they naturally comprise the codon usage preferred 
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in plants, so require little, if any, of the modification 
steps described below for non-plant genes. Particularly 
preferred, if available, are Cyt bg-encoding DNAs from the 
same plant species (or group of species) to be 
transformed with the chimeric gene. For instance, 
synthetic chimeric genes constructed for transformation 
of Brassica species might comprise a stearoyl CoA- 
encoding domain from rat and a Cyt bs domain from Brassica 
(see Figs. 1 and 2 for specific sources). This chimeric 
DNA would require optimization for expression in Brassica 
only in the desaturase domain. 

With respect to the naturally-occurring dual 
domain- encoding genes, as well as the chimeric genes 
discussed above, it will be appreciated that the DNA 
molecules can be prepared in a variety of ways, including 
DNA synthesis, cloning, mutagenesis, amplification, 
enzymatic digestion, and similar methods, all available 
in the standard literature. Additionally, certain DNA 
molecules can be obtained by access to public 
repositories, such as the American Type Culture 
Collection. Alternatively, DNA molecules that are not 
readily available, and/or for which sequence information 
is not available, can be isolated from biological sources 
using standard hybridization methods and homologous 
probes that are available. 

B. Optimization for expression in plants 

The second step in designing the synthetic DNA 
molecules of the invention is to customize (i.e. 
optimize) their sequence for expression in the plant 
cytoplasm. This is accomplished by performing one or 
more of the steps listed below on the coding sequence of 
the above described non-plant (or chimeric) 
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desaturase/Cyt b^-encoding DNA molecules. 

1 . From the peptide sequence encoded by the 
DNA, back translate using an appropriate plant codon 
usage table, making certain in particular that the most 
preferred translation termination codon is used. 

2. Visually, or with the aid of computer 
software, analyze the back- translated nucleotide sequence 
for features that could diminish or prevent expression in 
the plant cytoplasm. Such features include: (1) probable 
intron splice sites (characterized by T-rich regions) ; 

(2) plant polyadenylation signals (e.g., AATAAA) ; (3) 
polymerase II termination sequence (e.g., CAN(7.5jAGTNNAA, 
where N is any nucleotide) ; (4) hairpin consensus 
sequences (e.g., UCUUCGG) ; and (5) the sequence- 
destabilizing motif ATTTA (Shah & Kamen, Cell 46: 659- 
667, 1986) . These features have been described in the 
art (U.S. Patent No., 5,500,365 to Fischhoff et al . ; U.S. 
Patent No. 5,380,831 to Adang et al . ) . 

3 . Modify the back-translated sequence in 
light of any "problem" sequences identified in step 2. 
Note that this step may require the introduction of 
codons that are not the most preferred, but instead are 
second or third-most preferred, in order to eliminate the 
more problematic sequences identified in step 2. 

4. Introduce desirable cloning features, such 
as restriction sites, into the sequence in a manner that 
does not materially affect the desired codon usage or 
final polypeptide sequence . 

The aforementioned optimization procedure can 
be performed so that the final polypeptide sequence is 
identical to the initial polypeptide sequence, even 
though the underlying nucleotide sequence has been 
modified. This is a preferred embodiment of the 
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invention. However, it is entirely feasible to modify 
the initial sequence such that the final sequence is not 
identical to the initial sequence, either by virtue of 
amino acid substitutions, insertions or deletions. The 
more that is known about the structure/function 
relationship in a particular desaturase protein, the more 
liberties can be taken in modifying the protein sequence 
during the DNA optimization process. For instance, the 
present inventors have shown that the entire "coiled 
coil" domain of the yeast OLEl gene can be deleted, and 
the protein remains functional. Thus, it appears that 
OLEl can tolerate significant modification in the encoded 
protein without losing its biological activity. 

Codon usage tables for a variety of plants, 
including general plant codon usage tables, tables for 
dicots, tables for monocots, and tables for particular 
species, are widely available. Some of these are 
reproduced in Example 1 below. One good location to 
access such tables is the website: 

http : / /biochem . octago . ac . nz . 8 0 0/Transterm/codons . html . 

In an exemplary embodiment of the present 
invention, the above process is applied to the coding 
sequence of the yeast OLEl gene, which encodes a 
cytoplasmically expressed dual -domain protein comprising 
a A- 9 fatty acid desaturase domain and a Cyt domain. 
Optimization of the OLEl gene for expression in 
Arabidopsis and related species is described in detail in 
Example 1 . 

In another preferred embodiment, the coding 
sequence of the rat stearoyl CoA desaturase is modified 
for expression in plants according to the methods 
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described above. The modified sequence is operably 
linked to a coding sequence for a Cyt domain, 
preferably from a plant, and most preferably from 
Brassica. In this regard, it has been shown that 
expression of this rat desaturase in tobacco produces a 
functional protein that increases the 16:1 fatty acid 
content of plant tissues. Splice site prediction 
analysis of the rat desaturase reveals that there are no 
plant intron-like sequences within the open reading 
frame. However, codon usage analysis reveals that this 
desaturase possesses a number of codons that are not 
optimal for expression in plants, particularly 
Arabidopsis or Brassica. 

In another preferred embodiment, the protein 
coding sequences of the modified vectors described above 
are further modified to increase desaturase activity. 
This is done by altering specific amino acids in the 
encoded protein that control desaturase activity through 
post-translational modifications. These modifications 
are presumed to increase the level of desaturase activity 
in the host plant by stabilizing the desaturase protein 
or by increasing catalytic activity of the desaturase. 
Post translational modifications such as protein 
phosphorylation or dephosphoryiation have been shown to 
alter activity of a number of enzymes by a number of 
different mechanisms. These include increasing or 
decreasing enzyme activity or protein stability, or 
changing the intracellular location of the enzyme. An 
examination of a wide range of A-9 desaturase enzymes 
reveals the existence of a number of highly conserved 
potential phosphorylations sites that could serve as 
sequences that regulate desaturase activity. These are 
shown in bold face on the pile-up diagram in Figure 3 and 
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are summarized in Table 1 of Example 3. The high degree 
of homology between these sites suggests that these 
sequences may also be recognized by host plant 
phosphorylating or dephosphorylating enzymes. If 
phosphorylation of an amino acid within one of the sites 
increases the activity of the desaturase, the nucleic 
acid sequence corresponding to that amino acid can be 
altered to encode a negatively charged amino acid at that 
site to permanently increase the activity of the protein 
in the host. If phosphorylation of an amino acid within 
the site reduces the activity of the desaturase enzyme, 
the nucleic acid sequence can be altered to replace that 
amino acid with a neutral amino acid that will 
permanently increase the activity of the enzyme . 

In another preferred embodiment, elements of 
the genes in the modified vectors described above are 
further modified and improved by the linkage or 
substitution of sequences derived from native plant ER 
lipid faiosynthetic genes. Those sequences contain 
elements that improve the desaturase activity by 
increasing the efficiency of gene expression, 
intracellular protein targeting and/or enzyme stability. 
This is done by identifying elements of the engineered 
desaturase gene that can be replaced or linked with 
elements of a plant gene without significantly affecting 
the desired activity or specificity of the resulting 
enzyme. Genes and cDNAs that encode ER lipid 
biosynthetic enzymes from Brassica, Arabidopsis, 
Nicotiana tabacum. Borage, maize, sunflower and soybeans, 
as well as similar plant genes from any other species 
that are identified in the future, are contemplated for 
use in the synthetic genes of the present invention. 

In connection with the aforementioned 
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embodiment, but not limited thereto, it is particularly 
useful in many cases to pre-test constructs of the 
invention in a yeast expression system, in order to 
eliminate constructs that work poorly before taking the 
more labor- and time -intensive step of testing them in 
plants. Accordingly, this step may be incorporated into 
the methods described herein. 

III. Construction of vectors for transforming plant 
nuclei, and production of transgenic plants 
expressin g synthetic genes of the invention 

The synthetic genes of the present invention 
are intended for use in producing transgenic plants that 
optimally express a dual - function desaturase/Cyt bs 
protein in the cytoplasm of plant cells. Transformation 
of plant nuclei to produce transgenic plants may be 
accomplished according to standard methods known in the 
art. These include, but are not limited to, 
Agrobacteriiim vectors, PEG treatment of protoplasts, 
biolistic DMA delivery, UV laser microbeam, gemini virus 
vectors, calcium phosphate treatment of protoplasts, 
electroporation of isolated protoplasts, agitation of 
cell suspensions with microbeads coated with the 
transforming DNA, direct DNA uptake, liposome-mediated 
DNA uptake, and the like. Such methods have been 
published in the art. See, e.g., Methods for Plant 
Molecular Biology. Weissbach & Weissbach eds . , Academic 
Press, Inc. (1988); Methods in Plant Molecular Biology . 
Schuler & Zielinski, eds.. Academic Press, Inc. (1989); 
Plant Molecula r Biology Manual . Gelvin Schilperoort , 
Verma, eds., Kluwer Academic Publishers, Dordrecht 
(1993); and Methods in Plant Molecular Biology - A 
Laboratory Manual . Maliga, Klessig, Cashmere, Gruissem & 
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Varner, eds., Cold Spring Harbor Press (1994). 

The method of transformation depends upon the 
plant to be transformed. The bioiistic DNA delivery 
method is useful for nuclear transformation, and is a 
preferred method for practice of this invention. In 
another embodiment of the invention, AsrroJbacteriLim 
vectors are used to advantage for efficient 
transformation of plant nuclei. 

In a preferred embodiment, the synthetic gene 
is introduced into plant nuclei in Agrobacterium binary 
vectors. Such vectors include, but are not limited to, 
BIN19 (Bevan, Nucl . Acids Res., 12: 8711-8721, 1984) and 
derivatives thereof, the pBI vector series (Jefferson et 
al., EMBO J., 6: 3901-3907, 1987), and binary vectors 
pGA482 and pGA492 (An, Plant Physiol., 81: 86-91, 1986). 
A new series of Agrobacterium binary vectors, the pPZP 
family, is preferred for practice of the present 
invention. The use of this vector family for plant 
transformation is described by Svab et al . in Methods in 
Plant Molecular Bioloav - A Laboratory Manual . Maliga, 
Klessig, Cashmore, Gruissem and Varner, eds.. Cold Spring 
Harbor Press (1994) . 

Using an Ag-robacterium binary vector system for 
transformation, the synthetic gene of the invention is 
linked to a nuclear drug resistance marker, such as 
kanamycin or gentamycin resistance. Agrobacterium- 
mediated transformation of plant nuclei is accomplished 
according to the following procedure: 

(1) the gene is inserted into the selected 
Agrrobacterium binary vector; 

(2) transformation is accomplished by co- 
cultivation of plant tissue (e.g., leaf discs) with a 
suspension of recombinant Agrobacterium, followed by 
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incubation (e.g., two days) on growth medium in the 
absence of the drug used as the selective medium (see, 
e.g., Horsch et al . , Science 227: 1229-1231, 1985); 

(3) plant tissue is then transferred onto the 
selective medium to identify transformed tissue; and 

(4) identified transf orraants are regenerated 
to intact plants. 

It should be recognized that the amount of 
expression, as well as the tissue specificity of 
expression of the synthetic genes in transformed plants 
can vary depending on the position of their insertion 
into the nuclear genome. Such position effects are well 
known in the art; see Weising et al . , Ann. Rev. Genet,, 
22: 421-477 (1988) . For this reason, several nuclear 
transformants should be regenerated and tested for 
expression of the synthetic gene. 

IV. Uses of the synthetic genes and transgenic 
plants expressing those genes 

The synthetic desaturase genes of the invention 
and transgenic plants expressing those genes can be used 
for several agriculturally beneficial purposes. For 
instance, they can be used in oil-producing crops (e.g., 
corn, soybean, sunflower, rapeseed) to increase the 
overall percentages of monounsaturated fatty acids in 
those oils, thereby improving their health-promoting 
qualities. In this regard, the production of transgenic 
rapeseed plants {Brassica napus) is of particular 
interest in this invention. Example 1 describes a 
synthetic yeast desaturase gene modified for expression 
in Arahidopsis. Because the codon usage of Brassica is 
very similar to that of Arahidopsis, it is expected that 
the synthetic gene described in Example 1 will be as well 
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expressed in Brassica as it is in Arabidopsis . 

Another use for the synthetic genes of the 
invention is to modify the flavors of certain fruit or 
vegetable crops. It has already been shown that 
5 expression of the un-modif ied yeast A- 9 desaturase gene 
in tomato results in alterations in fatty acid 
composition and fatty acid-derived flavor compounds (Wang 
et al., 1996, supra). The synthetic, plant -optimized 
version of this gene is expected to function similarly, 

10 and also to be more efficiently expressed in plant cells. 

Another use for the synthetic genes of the 
invention is to facilitate the formation of omega-5 
anacardic acids, a class of secondary compounds derived 
from the A-9 desaturation of 14:0 in pest -resistant 

15 geraniums (Schultz et al., Proc . Natl. Acad. Sci. USA, 

93 : 877-885, 1996) . It has been shown that formation of 
these compounds proceeds from the expression of A9 
desaturase activity resulting in the formation of A9 
14:1. Subsequent elongation of these molecules leads to 

20 the formation of omega-5 22:1 and 24:1 in the trichome 
exudate that leads to pest resistance against spider 
mites and aphids. 

Another use for the synthetic genes of the 
invention are in the modification of membrane lipid fatty 

25 acyl composition to alter the properties of the 

cytoplasmic and plasma membranes of the cell . These may 
affect functions such membrane associated activities that 
are associated with membrane functions such as signal 
transduction, endocytosis or exocytotic events, entry of 

3 0 fungal or viral pathogens into the cell, and temperature 
or environmentally caused stress that causes physical 
changes in the fluid properties of the plasma membrane or 
internal cell membranes. Plants defective in desaturases 
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have been reported (Somerville and Browse, supra) . These 
mutant plants contain higher than normal levels of 
saturated fatty acids that may lower membrane fluidity 
under normal growing conditions. Thus the effects of 
temperature on these plants involved high temperature 
tolerance as opposed to chilling tolerance. These 
studies yielded interesting information that has 
relevance to temperature stress in general. A mutant of 
Arabidopsis deficient in 16:0 desaturation {Hugly et al, 
Plant Physiol. 90: 1134-1142) for example, has been shown 
to appear and grow normally at non-stressful 
temperatures. Under high temperature conditions, 
however, the mutant performs better than controls in 
growth and biosynthetic studies. Higher temperature 
stability was also noted in pea thylakoids following 
catalytic hydrogenation (Thoman et al . Biochera. Biophys . 
Acta 849 : 131-140, 1986) . 

The following examples are provided to describe 
the invention in greater detail. They are intended to 
illustrate, not to limit, the invention. 

EXAMPLE 1 

Modification of the Saccharomyces cerevisiae OLEl Gene 
for Expres sion in Arabidonsis and Related Species 

When introduced into tobacco and tomato plants, 
the yeast A- 9 desaturase gene {OLEl) was shown to 
desaturate palmitate and stearate, thereby reducing the 
levels of saturated fatty acids in triglycerides 
(Polashock et al . , supra; Wang et al . , supra). However, 
it was unclear whether optimum expression of the OLEl 
gene occurred in those species, and expression in other 
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plant species has been less than optimum. For example, 
the present inventors have found that the level of 
expression of the OLEl gene in tobacco (Polashock, et . 
al.. Plant Physiol. 100:894-901, 1992) and Arabidops is 
varies in different plant tissues and is generally poor 
in tobacco, and Arabidopsis seeds. Similarly, data from 
other investigators indicate that expression of OLEl in 
rapeseed {Brassica napus) seeds is also poor {U.S. Patent 
No. 5,777,201, to Poutre , et al . ) . 

Differential expression of heterologous genes 
in plants can be caused by several factors. It is often 
due to the presence of cryptic intron splicing signals. 
Thus, it is possible that the multiple banding patterns 
observed in northern blots of OiEI-transf ormed tobacco 
(Polashock et al . , supra) are due to splicing of the OLEl 
mRNA. 

In plants, the mRNA splicing mechanism is less 
well defined than in mammalian or yeast systems. There 
is some conservation of the 5 ' and 3 ' splicing signals 
but there is no conserved internal splice signal. 
However, with the accumulation of plant genomic DNA 
sequence data, it is now becoming possible to predict 
with some accuracy where intron splicing will occur 
(Hebsgaard, S.M., P.G. Korning, N. Tolstrup, J. 
Engelbrecht, P. Rouze and S. Brunak, Nucleic Acids 
Research 24 (17) : 3439-3452, 1996). In fact, computer 
programs that predict splice sites have now been 
developed (the "PlantNetGene" server for splice site 
predictions: http://www.cbs.dtu.dk/NetPlantGene.html) . 
From these sources, it appears that plant introns are 
typically identified as T rich sequences. 

Another factor affecting expression of foreign 
genes in plants is codon preference. It is now well 
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known that preference for certain codons exist among 
different phyla, classes, families, genera and species. 
Accordingly, by modifying a DNA sequence so that it uses 
codons preferred in a particular organism, expression of 
that sequence can be optimized. 

Other factors affecting the expression of 
foreign genes in plants include the presence of putative 
polyadenylation signals, hairpin cleavage consensus 
motifs, polymerase II termination sequences and the Shaw- 
Kamen sequence pattern ATTTA. 

This example describes the design and 
construction of ''pl-olel" , a modified Saccharomyces 
cerevisiae OLEl gene optimized for expression in 
Arahidopsis and other plant species. 

The nucleotide sequence of the Saccharomyces 
cerevisiae OLEl gene coding sequence has been described 
in U.S. Patent No. 5,057,419 to Martin et al . 
(incorporated by reference herein) and is set forth below 
for convenience as SEQ ID NO : 1 (open reading frame starts 
at +11) . The 3. cerevisiae A-9 desaturase amino acid 
sequence encoded by OLBl is set forth as SEQ ID NO: 2. 
I . Design o£ yl-olel 

To modify OLEl for optimum expression in 
plants, the OLEl sequence was first analyzed for cryptic 
plant splice signals, using the PlantNetGene server for 
splice site predictions. This analysis identified a 
number of "high confidence" intron splice signals in the 
OLEl sequence. These are shown below (positions 
correspond to position numbers in SEQ ID N0:1) . 
Donor splice site, direct strand ; 

5- - 3- 5, _ 3, 

Position Strand Confidence exon^ intron 

{Start ATG = +1) 
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3 97 + 1.00 GCTCTCTCTG^GTAAAGTACC 

1052 + 0.85 CTATTAAGTG^GTACCAA.TAC 

1074 + 1.00 CCCAACTAAG^GTTATCATCT 

Acceptor splice site, direct strand ; 

5- - 3' 5, _ 3- 

Posxtxon Strand Confidence intron'^eyon 

+ 0.86 GGTCTCACAG^ATCTTACTCC 



Next, the OLEl peptide sequence (SEQ ID NO: 2) 
was back- translated using an Arahidopsis thai i ana codon 
usage table, as shown below. Codon usage in Arabidopsis 
and several other plant species, including Brassica 
napus, Phaseolus vulgaris and Zea mays is very similar, 
as can be seen by a comparison with the respective codon 
usage tables of those species, also shown below (the 
codon usage table of Saccharomyces cerevisiae is shown 
for comparison; codon usage tables taken from 
Ahttp://biochem.otago.ac.nz:800/ Transterm/codons . html ) . 

Arahidopsis tha.llana . 

AmAcid Codon Number /lOOO Fraction 



Gly 


GGG 


6027 


. 00 


10 


.31 


0 


. 14 


Gly 


GGA 


15393 


. 00 


26 


.32 


0 


.37 


Gly 


GGT 


14890 


. 00 


25 


.46 


0 


.35 


Gly 


GGC 


5654 


. 00 


9 


.67 


0 


.13 


Glu 


GAG 


19825 


. 00 


33 


.90 


0 


.51 


Glu 


GAA 


18672 . 


. 00 


31, 


.93 


0 , 


.49 


Asp 


GAT 


20862 . 


. 00 


35. 


.67 


0 . 


.65 


Asp 


GAC 


11061. 


.00 


18. 


. 91 


0. 


.35 


Val 


GTG 


10414 . 


, 00 


17, 


. 81 


0 . 


.26 


Val 


GTA 


5145 . 


, 00 


8 . 


.80 


0. 


.13 


Val 


GTT 


16157 . 


00 


27. 


,63 


0 . 


.41 


Val 


GTC 


8156. 


00 


13 . 


95 


0 . 


.20 


Ala 


GCG 


5361. 


00 


9. 


17 


0. 


13 


Ala 


GCA 


10552 . 


00 


18. 


04 


0. 


25 


Ala 


GCT 


18782 . 


00 


32. 


12 


0. 


45 
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Ala 


GCC 


7249 . 00 


12.4 0 


0.17 


Arg 


AGG 


6684.00 






Arg 


AGA 


10280 . 00 


17.58 


0.34 


Ser 


AGT 


7369.00 


12.60 


0.16 


Ser 


AGC 


6399.00 


10 . 94 


0 . 14 


Lys 


AAG 




34 . 94 


0 . 55 


Lys 


AAA 


16882 . 00 


28 . 87 


0.45 


Asn 


AAT 


11658.00 


19.93 


0.47 


Asn 


AAC 


12987 .00 


22.21 


0.53 


Met 


ATG 


14817 . 00 




1.00 


He 


ATA 


6571 . 00 


11.24 


0.21 


He 


ATT 


13028 . 00 


22 .28 


0 .41 


He 


ATC 


11855.00 


20.27 


0 .38 


Thr 


ACG 


434 6 00 


7.43 


0 . 14 


Thr 


ACA 


8703 . 00 


14.88 


0.28 


Thr 


ACT 


10909 . 00 


18.65 


0.35 


Thr 


ACC 


6720 . 00 


11.49 


0.22 


Trp 


TGG 


Q ^; Q n n 

D £3 D £3 . U U 


11 . 74 


1.00 


End 


TGA 


652.00 


1.11 


0.44 


Cys 


TGT 


5641.00 


9.65 


0.58 


Cys 


TGC 


4154 . 00 


7.10 


0 .42 


End 


TAG 




0 . 43 


0 . 17 


End 


TAA 


591.00 


1.01 


0.40 


Tyr 


TAT 


8052 . 00 


13 .77 


0.47 


Tyr 


TAG 


8965 . 00 


15.33 


0 .53 


Leu 




T 1 '7') "7 n n 
/ .UU 


20.05 


0 . 22 


Leu 


TTA 


6361 . 00 


10 .88 


0. 12 


Phe 


TTT 


11703 . 00 


20.01 


0.47 


Phe 


TTC 


13066 . 00 


22 .34 


0.53 


Ser 




*± OO U . \J U 


8.26 


0 . 10 


Ser 


TCA 


9033 . 00 


15.45 


0.19 


Ser 


TCT 


13022 . 00 


22.27 


0.28 


Ser 


TCC 


6214 . 00 


10.63 


0. 13 


Arg 


CGG 


2531. 00 


4.33 


0.08 


Arg 


CGA 


3142 .00 


5.37 


0 . 10 


Arg 


CGT 


5680 . 00 


9.71 


0.19 


Arg 


CGC 


2100 . 00 


3.59 


0.07 


Gin 


GAG 


9564 . 00 


16.35 


0.47 


Gin 


CAA 


10908 . 00 


18.65 


0.53 
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His 


CAT 


7466 


. 00 


12 


.77 


0 


.58 


His 


CAC 


5415 


. 00 


9 


.26 


0 


.42 


Leu 


CTG 


5669 


. 00 


9 


.69 


0 . 


.11 


Leu 


CTA 


5350 


.00 


9 


. 15 


0 . 


. 10 


Leu 


CTT 


14395 . 


.00 


24, 


. 61 


0 . 


.27 


Leu 


CTC 


9751. 


.00 


16, 


.67 


0 . 


.18 


Pro 


CCG 


4676. 


, 00 


8 . 


. 00 


0 . 


. 17 


Pro 


CCA 


9131. 


. 00 


15. 


.61 


0 . 


33 


Pro 


CCT 


10732. 


.00 


18 . 


.35 


0 . 


39 


Pro 


CCC 


3331. 


,00 


5 . 


,70 


0 . 


12 



Brassxca. napus 

AraAcid Codon Number /lOOO Fraction 



Gly 


GGG 


730 


. GO 


11 


.21 


0 


. 13 


Gly 


GGA 


2042 


.00 


31 


.37 


0 


.36 


Gly 


GGT 


1952 


.00 


29 


.99 


0 


.35 


Gly 


GGC 


892 


. 00 


13 


.70 


0 


. 16 


Glu 


GAG 


2119 


.00 


32 


.55 


0 


.55 


Glu 


GAA 


1764 


.00 


27 


.10 


0 


.45 


Asp 


GAT 


1895 


. 00 


29 


.11 


0 


.56 


Asp 


GAG 


1478 


. 00 


22 


.70 


0 


.44 


Val 


GTG 


1231, 


.00 


18 


.91 


0 


.28 


Val 


GTA 


493 , 


. 00 


7 


.57 


0 , 


. 11 


Val 


GTT 


1624 , 


. 00 


24 


.95 


0 . 


.36 


Val 


GTC 


1124 . 


. 00 


17 


.27 


0. 


.25 


Ala 


GCG 


615. 


.00 


9. 


.45 


0 . 


.13 


Ala 


GCA 


1167 . 


. 00 


17 . 


.93 


0. 


.24 


Ala 


GCT 


2028 . 


. 00 


31. 


.15 


0 . 


,42 


Ala 


GCC 


1056. 


. 00 


16. 


.22 


0 . 


22 


Arg 


AGG 


697. 


, 00 


10 . 


,71 


0. 


22 


Arg 


AGA 


996 . 


,00 


15 . 


,30 


0 . 


32 


Ser 


AGT 


736. 


00 


11 . 


31 


0. 


15 


Ser 


AGC 


803 . 


00 


12 . 


34 


0 . 


17 


Lys 


AAG 


2243 . 


00 


34 . 


46 


0. 


55 


Lys 


AAA 


1817. 


00 


27. 


91 


0 . 


45 


Asn 


AAT 


1058 . 


00 


16. 


25 


0. 


37 


Asn 


AAC 


1811 . 


00 


27. 


82 


0. 


63 


Met 


ATG 


1538 . 


00 


23 . 


63 


1 . 


00 
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lie 


ATA 


669.00 


10.28 


0.20 


lie 


ATT 


1271.00 


19.52 


0.37 


lie 


ATC 


1461 . 00 


22.44 


0.43 


Thr 


ACG 


563 . 00 


8 . 65 




Thr 


ACA 


1059.00 


16 .27 


0.28 


Thr 


ACT 


1154 . 00 


17 . 73 


0.30 


Thr 


ACC 


1073 . 00 


16.48 


0.28 


Trp 


TGG 


798 . 00 


12 . 2 6 




End 


TGA 


69.00 


1.06 


0.37 


Cys 


TGT 


517.00 


7.94 


0.50 


Cys 


TGC 


509.00 


7.82 


0 . 50 


End 


TAG 


33.00 


0.51 




End 


TAA 


83 . GO 


1 .28 


0 . 45 


Tyr 


TAT 


792 . 00 


12 . 17 


0.38 


Tyr 


TAG 


1283.00 


19.71 


0.62 


Leu 


TTG 


1051 . 00 


16.14 


0.20 


Leu 


TTA 


508 . 00 


7.80 


0 . 09 


Phe 


TTT 


1003.00 


15.41 


0.39 


Phe 


TTC 


1562 . 00 


23 . 99 


0 . 61 


Ser 


TCG 


475.00 


7.30 


0 . 10 


Ser 


TCA 


856. 00 


13.15 


0.18 


Ser 


TCT 


1147.00 


17.62 


0 .24 


Ser 


TCC 


799 . 00 


12.27 


0 . 17 


Arg 


CGG 


219.00 






Arg 


CGA 


297.00 


4 . 56 


0 . 09 


Arg 


CGT 


659. 00 


10 . 12 


0.21 


Arg 


CGC 


275.00 


4.22 


0 . 09 


Gin 


CAG 


1188 . 00 






Gin 


CAA 


1168.00 


17.94 


0 . 50 


His 


CAT 


651.00 


10.00 


0 .49 


His 


CAC 


672. 00 


10.32 


0.51 


Leu 


CTG 


592.00 


9. 09 


0.11 


Leu 


CTA 


579 . 00 


8 . 89 


0.11 


Leu 


CTT 


1416. 00 


21.75 


0.26 


Leu 


CTC 


1208 . 00 


18 .56 


0 .23 


Pro 


CCG 


542 . 00 


8.33 


0.15 


Pro 


CCA 


1180 . 00 


18.13 


0.33 


Pro 


CCT 


1281 . 00 


19.68 


0.36 


Pro 


CCC 


527.00 


8 . 10 


0.15 
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Phaseolus vulgaris 



Gly 


GGG 


371.00 


13 .30 


0.15 


Gly 


GGA 


771.00 


27.64 


0.32 


Gly 


GGT 


817. 00 


29.29 


0.34 


Gly 


GGC 


441 . 00 


15 . 81 




Glu 


GAG 


912.00 


32.69 


0 . 54 


Glu 


GAA 


767. 00 


27.50 


0.46 


Asp 


GAT 


776. 00 


27. 82 


0.55 


Asp 


GAG 


625 . 00 


22 . 4 1 




Val 


GTG 


661. 00 


23 .70 


0.36 


Val 


GTA 


174 . 00 


6.24 


0.09 


Val 


GTT 


653.00 


23.41 


0 .36 


Val 


GTC 


346.00 


12 . 4 0 




Ala 


GCG 


180 . 00 


6.45 


0.09 


Ala 


GCA 


528 . 00 


18.93 


0 .26 


Ala 


GCT 


791 . 00 


28.36 


0.39 


Ala 


GCC 


553 . 00 






Arg 


AGG 


324 . 00 


11.61 


0.29 


Arg 


AGA 


325 . 00 


11.65 


0.29 


Ser 


AGT 


317.00 


11.36 


0.14 


Ser 


AGC 






0 . 15 


Lys 


AAG 


1054 . 00 


37.78 


0 . 60 


Lys 


AAA 


697 . 00 


24.99 


0.40 


Asn 


AAT 


555 . 00 


19.90 


0 .42 


Asn 


AAC 


782 . 00 






Met 


ATG 


567 . 00 


20.33 


1.00 


He 


ATA 


274 . 00 


9.82 


0.20 


He 


ATT 


539.00 


19.32 


0.40 


He 


ATC 






0.40 


Thr 


ACG 


166.00 


5.95 


0.11 


Thr 


ACA 


362 . 00 


12 .98 


0 .24 


Thr 


ACT 


480 . 00 


17.21 


0 .32 


Thr 


ACC 


490 . 00 


17.57 


0.33 


Trp 


TGG 


342 . 00 


12.26 


1 . 00 


End 


TGA 


34 . 00 


1.22 


0.44 


Cys 


TGT 


145 . 00 


5.20 


0.39 


Cys 


TGC 


229 . 00 


8.21 


0 . 61 


End 


TAG 


22 . 00 


0.79 


0.28 


End 


TAA 


22 . 00 


0 . 79 


0.28 
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Tyr 


TAT 


400 


. 00 


14 


.34 


0 


.40 


Tyr 


TAG 


597 


. 00 


21 


.40 


0 


.60 


Leu 


TTG 


543 


. 00 


19 


. 47 


0 


. 24 


Leu 


TTA 


184 


. 00 


6 


.60 


0 


. 08 


Phe 


TTT 


458 


. 00 


16 


.42 


0 


.43 


Phe 


TTC 


601 


.00 


21 


.55 


0 


.57 


Ser 


TCG 


149 


. 00 


5 








Ser 


TCA 


416 


.00 


14 


.91 


0 


. 18 


Ser 


TCT 


606 


. 00 


21 


.72 


0 


.26 


Ser 


TCC 


501 


. 00 


17 


.96 


0 


.21 


Arg 


CGG 


71 


. 00 










Arg 


CGA 


76 


. 00 


2, 


.72 


0 


.07 


Arg 


CGT 


169 


.00 


6, 


,06 


0 


. 15 


Arg 


CGC 


158 . 


. 00 


5 . 


,66 


0 


. 14 


Gin 


CAG 










0 . 


.48 


Gin 


CAA 


470 . 


, 00 


16. 


85 


0. 


.52 


His 


CAT 


298. 


,00 


10. 


68 


0 . 


,46 


His 


CAC 


355. 


,00 


12. 


73 


0 . 


.54 


Leu 


CTG 


351. 


00 


12. 


58 


0 . 


15 


Leu 


CTA 


184 . 


00 


6. 


60 


0. 


08 


Leu 


CTT 


569. 


00 


20. 


40 


0 . 


25 


Leu 


CTC 


452 . 


00 


16 . 


20 


0 . 


20 


Pro 


COG 


147. 


00 


5. 


27 


0. 


08 


Pro 


CCA 


694 . 


00 


24. 


88 


0. 


37 


Pro 


CCT 


664 . 


00 


23. 


80 


0 . 


36 


Pro 


ccc 


352. 


00 


12 . 


62 


0. 


19 



Zea mays 

AmAcid Codon Number /lOOO Fraction 



Gly 


GGG 


2466 


.00 


15 


.07 


0 


. 19 


Gly 


GGA 


2186 


. 00 


13 


.36 


0 


.17 


Gly 


GGT 


2607. 


. 00 


15 


.93 


0 , 


.20 


Gly 


GGC 


5499 . 


, 00 


33 . 


.61 


0 . 


.43 


Glu 


GAG 


7364 . 


, 00 


45. 


,01 


0 . 


. 72 


Glu 


GAA 


2823 . 


, 00 


17, 


,25 


0. 


.28 


Asp 


GAT 


3425. 


.00 


20. 


.93 


0. 


,37 


Asp 


GAC 


5740 . 


00 


35. 


,08 


0 . 


. 63 


Val 


GTG 


4365. 


00 


26. 


68 


0. 


38 
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Val 


GTA 


916. 00 


5 . 60 


0 . 08 


Val 


GTT 


2516. 00 


15.38 


0 .22 


Val 


GTC 


3644 . 00 


22 .27 


0.32 


Ala 


GCG 


3698 . 00 


22 . 60 




Ala 


GCA 


2517 . 00 


15 .38 


0 . 16 


Ala 


GCT 


3602 . 00 


22.01 


0.24 


Ala 


GCC 


5481 . 00 


33.50 


0.36 


Arg 


AGG 


2500 . 00 






Arg 


AGA 


1199.00 


7.33 


0 . 13 


Ser 


AGT 


1170 . 00 


7.15 


0.10 


Ser 


AGO 


2776. 00 


16.97 


0.24 


Lys 


AAG 


7241 . 00 




0 . 79 


Lys 


AAA 


1969. 00 


12 . 03 


0.21 


Asn 


AAT 


1946 . 00 


11-89 


0.33 


Asn 


AAC 


3939 . 00 


24 . 07 


0 . 67 


Met 


ATG 


4071 . 00 




1.00 


He 


ATA 


1014 . 00 


6.20 


0.13 


He 


ATT 


2099. 00 


12.83 


0.28 


He 


ATC 


4403 . 00 


26.91 


0.59 


Thr 


ACG 


1890 . 00 


11 . 55 


0 . 22 


Thr 


ACA 


1620 . 00 


9.90 


0. 19 


Thr 


ACT 


1757.00 


10 . 74 


0.21 


Thr 


ACC 


3236. 00 


19.78 


0.38 


Trp 


TGG 


1 994 0 0 


12 . 19 


1.00 


End 


TGA 


199 . 00 


1.22 


0 .45 


Cys 


TGT 


770.00 


4.71 


0.28 


Cys 


TGC 


1963 . 00 


12.00 


0.72 


End 


TAG 






0.28 


End 


TAA 


120.00 


0 . 73 


0.27 


Tyr 


TAT 


1303 . 00 


7.96 


0.27 


Tyr 


TAG 


3440 . 00 


21.02 


0.73 


Leu 


TTG 


1807 . 00 


11.04 


0.13 


Leu 


TTA 


582 .00 


3.56 


0 . 04 


Phe 


TTT 


1697.00 


10.37 


0 .29 


Phe 


TTC 


4082 . 00 


24.95 


0 . 71 


Ser 


TCG 


1620.00 


9.90 


0 . 14 


Ser 


TCA 


1592 . 00 


9.73 


0.14 


Ser 


TCT 


1792 . 00 


10.95 


0.15 


Ser 


TCC 


2746 . 00 


16.78 


0 .23 
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Arg 


CGG 


1505 


. 00 


9 


. 2 0 




Arg 


CGA 


610 


.00 


3 


.73 


0 


Arg 


CGT 


1018 


.00 


6 


.22 


0 


Arg 


CGC 


2562 


. 00 


15 


. 66 


0 


Gin 


CAG 


4280 


. 00 








Gin 


CAA 


1626 


. 00 


9 


.94 


0 . 


His 


CAT 


1378, 


. 00 


8 


.42 


0. 


His 


CAC 


2431. 


.00 


14 , 


.86 


0 . 


Leu 


CTG 


4069 , 


. 00 


24 . 


, 87 


0 . 


Leu 


CTA 


904 . 


.00 


5. 


,52 


0 . 


Leu 


CTT 


2415. 


. 00 


14 . 


. 76 


0. 


Leu 


CTC 


4079. 


00 


24 . 


,93 


0 . 


Pro 


CCG 


2642 . 


00 


16. 


15 


0 . 


Pro 


CCA 


2152. 


00 


13 . 


15 


0 . 


Pro 


CCT 


2102. 


00 


12 . 


85 


0 . 


Pro 


CCC 


2344 . 


00 


14. 


33 


0 . 



Saccharomyces cerevxsiae 

AmAcid Codon Number 



/lOOO 



Gly 


GGG 


18129 


. 00 


6 


.18 


0 


.12 


Gly 


GGA 


32850 


. 00 


11 


.20 


0 


.22 


Gly 


GGT 


66575 


. 00 


22 


.69 


0 


.45 


Gly 


GGC 


28821 


.00 


9 


.82 


0 


.20 


Glu 


GAG 


57100 


. 00 


19 


.46 


0 


.30 


Glu 


GAA 


133513 


. 00 


45 


.51 


0 


.70 


Asp 


GAT 


111120 


.00 


37 


.88 


0 


.65 


Asp 


GAC 


58642 


.00 


19 


.99 


0 


.35 


Val 


GTG 


32144. 


.00 


10, 


.96 


0 


.20 


Val 


GTA 


35470. 


.00 


12, 


.09 


0 , 


.22 


Val 


GTT 


63678 . 


. 00 


21, 


.71 


0 , 


,39 


Val 


GTC 


33136. 


, 00 


11. 


.30 


0 . 


,20 


Ala 


GCG 


18402 . 


00 


6. 


27 


0 . 


11 


Ala 


GCA 


47728 . 


00 


16. 


27 


0 . 


30 


Ala 


GCT 


58916. 


00 


20. 


08 


0 . 


37 


Ala 


GCC 


35917. 


00 


12 . 


24 


0. 


22 


Arg 


AGG 


27990 . 


00 


9. 


54 


0. 


21 


Arg 


AGA 


61524. 


00 


20. 


97 


0. 


47 


Ser 


AGT 


42499 . 


00 


14 . 


49 


0 . 


16 


Ser 


AGC 


29298 . 


00 


9. 


99 


0 . 


11 
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Lys 


AAG 


89539 


. 00 


3 0 


- 52 




Lys 


AAA 


124327 


. 00 


42 


-38 


0 .58 


Asn 


AAT 


106379 


. 00 


36 


.26 


0.60 


Asn 


AAC 


71659 


. 00 


24 


.43 


0.40 


Met 


ATG 


61216 


. 00 








He 


ATA 


53773 


. 00 


18 


.33 


0.28 


He 


ATT 


88869 


. 00 


30 


.29 


0 .46 


He 


ATC 


49422 


. 00 


16 


.85 


0.26 


Thr 


ACG 


24131 








0 . 14 


Thr 


ACA 


52363 


. 00 


17 


. 85 


0.31 


Thr 


ACT 


58260 


. 00 


19 


.86 


0 .34 


Thr 


ACC 


35998 


. 00 


12 


.27 


0.21 


Trp 


TGG 


3 0707 


. 00 






1.00 


End 


TGA 


1901 


.00 


0 


.65 


0 .30 


Cys 


TGT 


23942 


.00 


8 


. 16 


0.62 


Cys 


TGC 


14448 


-00 


4 


- 93 


0.38 


End 








0 


.48 


0.23 


End 


TAA 


2985 . 


.00 


1. 


. 02 


0.47 


Tyr 


TAT 


55441. 


. 00 


18. 


.90 


0.57 


Tyr 


TAG 


42016. 


. 00 


14 . 


.32 


0.43 


Leu 


TTG 










0.28 


Leu 


TTA 


77691. 


. 00 


26. 


.48 


0.28 


Phe 


TTT 


78451. 


, 00 


26. 


, 74 


0.59 


Phe 


TTC 


53809 . 


00 


18. 


34 


0 .41 


Ser 


TCG 








81 


0.10 


Ser 


TCA 


55962 . 


00 


19. 


08 


0.21 


Ser 


TCT 


69019 . 


00 


23 . 


53 


0.26 


Ser 


TCC 


41460 . 


00 


14 . 


13 


0 .16 


Arg 


CGG 










0 . 04 


Arg 


CGA 


9166- 


00 


3 . 


12 


0 . 07 


Arg 


CGT 


18429 . 


00 


6 . 


28 


0 . 14 


Arg 


CGC 


7924 . 


00 


2 . 


70 


0.06 


Gin 


CAG 


36018 . 


00 


12 . 


28 


0.31 


Gin 


CAA 


78385. 


00 


26 . 


72 


0 . 69 


His 


CAT 


40211 . 


00 


13 . 


71 


0 . 64 


His 


CAC 


22609. 


00 


7 . 


71 


0.36 


Leu 


CTG 


31503 . 


00 


10. 


74 


0.11 


Leu 


CTA 


39789 . 


00 


13 . 


56 


0.14 


Leu 


CTT 


36697 . 


00 


12 . 


51 


0.13 


Leu 


CTC 


16401 . 


00 


5. 


59 


0 . 06 
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Pro 
Pro 
Pro 
Pro 



CCG 
CCA 
CCT 
CCC 



15796 . 00 
51725 . 00 
39402 . 00 
20387 . 00 



5.38 
17.63 
13 .43 

6.95 



0 . 12 
0.41 
0.31 
0 . 16 



For each amino acid, the new pl-olel gene was 



designed the codon most preferred in Arabidopsis, with 
the following exceptions: 



switched to CAA. Though the codon preference for 
glutamine is the same for both CAG and CAA in 
Arabidopsis, CAA was used since the AG motif is part of 
the 3' intron splice signal. 

2. In OLEl, there are regions of high 
leucine/valine amino acid usage (e.g., between positions 
322 to 571 of the nucleotide sequence are codons coding 
for 11 leucines and 7 valines) . These regions correspond 
to the OLEl protein transmembrane domains. If the most 
preferred codons in Arabidopsis (CTT and GTT, 
respectively) were used, the region would take on the 
characteristics of a plant intron, i.e., high T content, 
thereby introducing a number of highly probable 5 ' splice 
sites, which could not be removed without altering the 
amino acid sequence. Accordingly, a mixture of 
alternative codons was used for these amino acids. 
Similar changes were also applied to two other regions of 
OLEl (positions 781 to 900 and positions 1081 to 1140) . 



as putative polyadenylation signals, hairpin cleavage 
consensus motifs, ATTTA motifs or concatamers thereof, 
was conducted. Such sequences are described in detail in 
U.S. Patent No. 5,380,831 to Adang et al . (incorporated 
by reference herein) . This search identified one hairpin 
cleavage consensus motif, CTTCGG, at position 553-559 of 



1. 



The codon for glutamine CAG was 



Next, a search for problematic sequences, such 
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SEQ ID N0:1, which was removed by changing TTC to TTT 
(both encoding phenylalanine) . 

Next, a BamHI site and translation initiation 
consensus were added to the 5' end of the OLEl coding 
sequence (M. Kozak, J. Biol. Chem. 266 (30) : 19867-19870, 
1991) . An Xbal and a BamHI site were added to the 3' end 
of the coding sequence. A Pad site was introduced into 
the same position as the original S. cerevisiae OLEl Pad 
site (within the cytochrome domain) , in order to 
provide a convenient restriction site for construction of 
this and other synthetic OLEl genes. Other convenient 
restriction sites, which enable modular construction of 
synthetic OLEl genes, are inherent within the final 
sequence of the new pl-olel gene. 

Finally, the termination codon was checked 
against a stop codon consensus database, "TransTerm" 
(Dalphin et al . , Nucl . Acids Res. 25 (1) : 246-247, 1997). 
The existing termination sequence, TGAT, appeared 
suitable for use in Arabidopsis, and so was not altered. 
II . Construction of d-olel t 

The rebuilt pl-olel nucleotide sequence was 
constructed commercially (Operon Technologies, Inc.). 
The plasmid containing the rebuilt gene was designated 
PAMCM013. The pl-olel nucleotide sequence is set forth 
below as SEQ ID N0:3 (open reading frame starts at +11). 
This sequence encodes SEQ ID N0:2, but differs from the 
S. cerevisiae OLEl gene (SEQ ID N0:1) in the following 
respects (summarized from above) : 

1. Arabidopsis thaliana codon usage; CAG 
switched to CAA for glutamine; 

2. Translation initiation consensus added; 

3 . Hairpin removed; 

4. Several (but not all) PlantNetGene 
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predicted splice sites removed; 

5. Eleven leucines changed from CTT to CTC, 
and 7 valines changed from GTT to GTG in positions 322- 
571, which corresponds to a plant intron-like region; 
similar changes made in regions 781-900 and 1081-1140; 
valine at position 432 retained as GTT to maintain 
Pspl406I site; 

6. Certain leucine and valine codons were 
altered so that the same codons would not appear adjacent 
to others; 

7. Intron acceptor site at position 1047 

altered; 

8. Restriction sites added to allow modular 
construction; PsP1406I site removed at position 1441; and 

9. Pad site introduced at position 13 62; an 
introduced NgoMI site at position 867 removed. 

A gap alignment of SEQ ID NO: 1 (top) and SEQ 
ID NO: 3 (bottom) is shown below: 

Gap alignment of wild type and rebuilt OLEl sequences. 
Percent Similarity: 79.871 Percent Identity: 79.871 

1 TACAACAAAGATGCCAACTTCTGGAACTACTATTGAATTGATTGACGACC 50 

Ml Mill IIIMIMIIIIMIII li I II II II ! 

1 ggatccaacaATGCCTACTTCTGGAACTACTATCGAGCTTATCGATGATC 5 0 
51 AATTTCCAAAGGATGACTCTGCCAGCAGTGGCATTGTCGACGAAGTCGAC 100 

MM M IIIIIMI Mill III li II II II M M 

51 AATTCCCTAAGGATGATTCTGCTTCTTCTGGAATCGTTGATGAGGTTGAT 100 
101 TTAACGGAAGCTAATATTTTGGCTACTGGTTTGAATAAGAAAGCACCAAG 15 0 

I II II MM! II I illlllll I II Mill li II M 

101 CTTACTGAGGCTAACATCCTTGCTACTGGACTTAACAAGAAGGCTCCTAG 150 
151 AATTGTCAACGGTTTTGGTTCTTTAATGGGCTCCAAGGAAATGGTTTCCG 2 00 

Ml II MM! I! M III I Mill M lilil MMIIII i 

151 AATCGTTAACGGATTCGGATCTCTTATGGGATCTAAGGAGATGGTTTCTG 200 
201 TGGAATTCGACAAGAAGGGAAACGAAAAGAAGTCCAATTTGGATCGTCTG 250 

I M Mill MMIIIIIIIMI lllMMi II I III I II 

201 TTGAGTTCGATAAGAAGGGAAACGAGAAGAAGTCTAACCTTGATAGACTT 250 
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251 CTAGAAAAGGACAACCAAGAAAAAGAAGAAGCTAAAACTAAAATTCACAT 3 00 

II 11 INN MINIM M M M MM! MiM M M M 

251 CTTGAGAAGGATAACCAAGAGAAGGAGGAGGCTAAGACTAAGATCCATAT 3 00 
3 01 CTCCGAACAACCATGGACTTTGAATAACTGGCACCAACATTTGAACTGGT 350 

Ml M Mill IMMI I II IIIIMII MUM I IIIMI 

3 01 CTCTGAGCAACCTTGGACTCTcAACAACTGGCATCAACATCTcAACTGGC 3 50 
351 TGAACATGGTTCTTGTTTGTGGTATGCCAATGATTGGTTGGTACTTCGCT 400 

1 MMMM II II Mill Mill MIM II lillMMIMI 

351 TcAACATGGTgCTcGTcTGTGGAATGCCTATGATCGGATGGTACTTCGCT 400 

4 01 CTCTCTGGTAAAGTACCTTTGCATTTAAACGTTTTCCTTTTCTCCGTTTT 450 

MMMM Mill Ml I III I MMMIIMI IMM II II 

4 01 CTcTCTGGAAAaGTgCCTCTcCATCTcAACGTTTTCcTcTTCTCTGTcTT 450 
451 CTACTACGCTGTCGGTGGTGTTTCTATTACTGCCGGTTACCATAGATTAT 500 

_ MIMIIMIII M M M MM! Mill M IIIMIMI i I 

451 CTACTACGCTGTTGGAGGAGTgTCTATCACTGCTGGATACCATAGACTcT 500 
501 GGTCTCACAGATCTTACTCCGCTCACTGGCCATTGAGATTATTCTACGCT 55 0 

_ IMMII MMMIIMI Mill Mill I Ml I IMIIIIII 

501 GGTCTCATAGATCTTACTCTGCTCATTGGCCTCTTAGACTcTTCTACGCT 550 
551 ATCTTCGGTTGTGCTTCCGTTGAAGGGTCCGCTAAATGGTGGGGCCACTC 600 

Mill II MMMM Mill II M Mill MMMM II M 

551 ATCTTtGGATGTGCTTCTGTTGAGGGATCTGCTAAGTGGTGGGGACATTC 600 
601 TCACAGAATTCACCATCGTTACACTGATACCTTGAGAGATCCTTATGACG 650 

_ MMMM M III MIMMIMM IMIIIIIIIM 11 I 

601 TCATAGAATCCATCATAGATACACTGATACTCTTAGAGATCCTTACGATG 650 
651 CTCGTAGAGGTCTATGGTACTCCCACATGGGATGGATGCTTTTGAAGCCA 700 

M I Mill M MMMM II II II M M II M M I I Mill 

651 CTAGAAGAGGACTTTGGTACTCTCATATGGGATGGATGCTTCTTAAGCCT 700 
7 01 AATCCAAAATACAAGGCTAGAGCTGATATTACCGATATGACTGATGATTG 750 

_ M 11 M illMIIIIIIIMIMIII II IIIMIMIMIIIIII 

701 AACCCTAAGTACAAGGCTAGAGCTGATATCACTGATATGACTGATGATTG 750 
751 GACCATTAGATTCCAACACAGACACTACATCTTGTTGATGTTATTAACCG 8 00 

_ Ml M II II I IMMI IMM IMMII II I III i 1 || \ 

751 GACTATCAGATTCCAACATAGACATTACATCtTgCTcATGCTcCTTACTG 8 00 
801 CTTTCGTCATTCCAACTCTTATCTGTGGTTACTTTTTCAACGACTATATG 850 

_ Mlllll II II Mill Mill II I Mill MMMM II III 

801 CTTTCGTgATCCCTACTCTcATCTGTGGATACTTCTTCAACGATTACATG 850 
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851 GGTGGTTTGATCTATGCCGGTTTTATTCGTGTCTTTGTCATTCAACAAGC 900 

II M I lllli M j| M II i I! M mil lIlMlli 

8 51 GGAGGACTcATCTACGCTGGATTCATCAGAGTgTTCGTcATCCAACAAGC 900 
901 TACCTTTTGCATTAACTCCATGGCTCATTACATCGGTACCCAACCATTCG 950 

ni II II II Mill lllllllllllllllll M Mill MM 

9 01 TACTTTCTGTATCAACTCTATGGCTCATTACATCGGAACTCAACCTTTCG 950 
951 ATGACAGAAGAACCCCTCGTGACAACTGGATTACTGCCATTGTTACTTTC 1000 

MM MIMMI III I M llllllll MM! II 111111111 

951 ATGATAGAAGAACTCCTAGAGATAACTGGATCACTGCTATCGTTACTTTC 1000 
10 01 GGTGAAGGTTACCATAACTTCCACCACGAATTCCCAACTGATTACAGAAA 1050 

M M M IIMMIIIIMII II M IIMI IIMllll Mill 

1001 GGAGAGGGATACCATAACTTCCATCATGAGTTCCCTACTGATTAtAGaAA 1050 
1051 CGCTATTAAGTGGTACCAATACGACCCAACTAAGGTTATCATCTATTTGA 1100 

MMM MMIIMMMMIII M Nil! II llllllll Mil 

1051 CGCTATCAAGTGGTACCAATACGATCCTACTAAaGTgATCATCTACtTgA 1100 
1101 CTTCTTTAGTTGGTCTAGCATACGACTTGAAGAAATTCTCTCAAAATGCT 1150 

MMI I M M II II Mill j lllli IIMMIIIII III 

1101 CTTCTCTcGTgGGACTTGCTTACGATCTcAAGAAGTTCTCTCAAAACGCT 1150 
1151 ATTGAAGAAGCCTTGATTCAACAAGAACAAAAGAAGATCAATAAAAAGAA 12 00 

M M M M I M IIIMIM IIIIIIIMIIMI M Mill 

1151 ATCGAGGAGGCTCTTATCCAACAAGAGCAAAAGAAGATCAACAAGAAGAA 1200 

12 01 GGCTAAGATTAACTGGGGTCCAGTTTTGACTGATTTGCCAATGTGGGACA 1250 

IMIIIIIMM Mill M III I IIIIM I n MIMMI I 

1201 GGCTAAGATtAAtTGGGGACCTGTTCTTACTGATCTTCCTATGTGGGATA 1250 
1251 AACAAACCTTCTTGGCTAAGTCTAAGGAAAACAAGGGTTTGGTTATCATT 1300 

I Mill Mi I MIMMIMIMI IIMIIIi I IIIMIM 

1251 AGCAAACTTTCCTTGCTAAGTCTAAGGAGAACAAGGGACTTGTTATCATC 13 00 
1301 TCTGGTATTGTTCACGACGTATCTGGTTATATCTCTGAACATCCAGGTGG 13 50 

Mill M Mill II M IIMI II MIMMI Mili M I! 

13 01 TCTGGAATCGTTCATGATGTTTCTGGATACATCTCTGAGCATCCTGGAGG 1350 
1351 TGAAACTTTAATTAAAACTGCATTAGGTAAGGACGCTACCAAGGCTTTCA 14 0 0 

M MIMMIIM Mill I M IIIM Mill Mlilllll 

13 51 AGAGACT 1 1 aATt AAGACTGCTCTTGGAAAGGATGCTACTAAGGCTTTCT 14 00 

14 01 GTGGTGGTGTCTACCGTCACTCAAATGCCGCTCAAAATGTCTTGGCTGAT 1450 

Ml II II III 1 11 11 11 II MIMMI II I IIIIM 

1401 CTGGAGGAGTTTACAGACATTCTAACGCTGCTCAAAACGTGCTTGCTGAT 1450 
1451 ATGAGAGTGGCTGTTATCAAGGAAAGTAAGAACTCTGCTATTAGAATGGC 15 00 
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Milllll IIMIIIIIIIMI IIIIIIIIMIIIII IIMIIII 

1451 ATGAGAGTTGCTGTTATCAAGGAGTCTAAGAACTCTGCTATCAGAATGGC 1500 
1501 TAGTAAGAGAGGTGAAATCTACGAAACTGGTAAGTTCTTTTAAGCATCAC 1550 

I NMIIIM II llliilll mil IIMIIII Ml I 

1501 TTCTAAGAGAGGAGAGATCTACGAGACTGGAAAGTTCTTCTGAtctagag 1550 
1551 ATTAC 1555 

! I 

1551 gatcc 1555 



The pl-olel synthetic gene contains no intron- 
like regions, or predicted splice sites within its 
sequence. Moreover, comparing the codon usage of 
Arabidopsis with that of Brassica. napus, Phaseolus 
vulgaris or Zea mays, with the exception of cystein (a 
rare amino acid that comprises 1.7% of all Arabidopsis 
codons, and occurs 4 times (0.8%) in OLEl) , the sequence 
contains no rare codons for any of those species . The 
codon usage of pl-olel is particularly similar to the 
preferred usage of Brassica napus. Accordingly, pl-olel 
is expected to be particularly well expressed in all 
those species, and well expressed in any plant species. 

An alternative version of pl-olel, referred to 
herein as pl-olel-2, was also constructed. This 
synthetic gene was modified only in specific codons 
identified as high frequency splicing signals. It was 
discovered that this construct is expressed equally as 
well as pl-olel in Arabidopsis . 



EXAMPLE 2 

Vacuum Infiltration Transformation of 
ArahidoDsis thaliana with pl-olel 

A modification of a transformation protocol of 
Pam Green (http://www.bch.msu.edu/pamgreen/vac.html) was 
used for the transformation of A. thaliana with pl-olel. 
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The protocol was adapted from protocols by Nicole 
Bechtold and Andrew Bent. This protocol gives very good 
results, with 95% of all infiltrated plants giving rise 
to transformants, and a transformant in up to 1 in 2 5 
seeds . 

PROTOCOI. ! 

1. Seeds of Arabidopsis thaliana ecotype 
Columbia were sown in lightweight plastic pots prepared 
in the following way: mound Arabidopsis soil mixture into 
3 to 4 inch pots, saturate soil with Arabidopsis 
fertilizer, add more soil so that it is rounded about 0.5 
above the edge . 

2. Plants were grown under conditions of 16 
hours light / 8 hours dark at 20°C, fertilizing with 
Arabidopsis fertilizer once a week from below, adding 
about 0.5 L to each flat. After 4-6 weeks, plants were 
considered ready for vacuum infiltration when primary 
inflorescence was 10-15 cm tall and the secondary 
inflorescences appeared at the rosette. The bolts were 
clipped back and 2 to 3 days was allowed for them to 
regrow before infiltration. 

3. In the meantime, the construct was 
transformed into Agrobactsrium tumefaciens strain 
(LBA4404) . When plants were ready to transform, a 50 mL 
culture of LB medium containing 50 rag/L kanamycin and 50 
rag/L of streptomycin was inoculated with a 1 mL overnight 
starter culture. 

4. Cultures were grown overnight at 28° C with 
shaking. The culture was pelleted, the supernatant 
removed, and the pellet resuspended in 250 ml of 
infiltration medium to OD600 >0.8. Infiltration medium 
(1 liter) comprised 2.2 g MS salts, 1 X B5 vitamins, 50 g 
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sucrose, 0.5 g MES, pH to 5.7 with KOH, 0.044 =B5M 
benzylaminopurine, 200 =B5L Silwet L-77 (OSl 
Specialties) . 

5. The resuspended culture was placed in a 
5 magenta jar inside a large bell jar. Pots containing 

plants to be infiltrated were inverted into the solution 
so that the entire plant was covered, including rosette, 
but none of the soil was submerged. 

6. A vacuum of 4 00 mm Hg (about 17 inches) was 
10 drawn. Once the vacuum level was reached, the suction was 

closed and the plants allowed to remain under vacuum for 
five minutes. The vacuum was then quickly released. The 
pots were briefly drained, then placed on 
their sides in a tray, which was covered with a humidome 
15 to maintain humidity. The next day, the plants were 

removed to the growth room, the pots uncovered and set 
upright. Plants infiltrated with different constructs 
were kept separated in different trays thereafter. 

7. Plants were allowed to grow under the same 
20 conditions as before. Plants were staked individually as 

the bolts grew. When plants were finished flowering, 
water was gradually reduced, then eliminated to allow the 
plants to dry out . Seeds were harvested from each plant 
individually . 

25 8. Large selection plates were prepared: 4.3 

g/L MS salts; 1 X B5 vitamins (optional) ; 1 % sucrose; 
0.5 g/L MES pH to 5.7 with KOH; 0.8% phytagar - 
Autoclaved, then added antibiotics (35 fxg/mL kanamycin 
and 250 /zg/mL of carbenicillin) and 150 X 15 ram plates 

30 were poured. 

9. Plates were dried well in the sterile hood 
before plating - 20-3 0 minutes with the lids open was 
usually sufficient. 
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10. For each plant, up to 100 /iL of seeds 
(approximately 2500 seeds) was sterilized and plated out 
individually. Seeds were sterilized as follows: 1 min 
in 70% ethanol, 7 minutes in 50% bleach / 0.02 % Triton 

5 X-100 with vortexing, 6 rinses in sterile distilled 
water. Seeds were resuspended in 2 mL sterile 0.1% 
agarose and poured onto large selection plates as if 
plating phage. Plates were tilted so seeds were evenly 
distributed, and allowed to sit 10-15 minutes, during 
10 which time the liquid soaked into the medium. Plates 
were sealed with Parafilm and placed in a growth room. 

11. After 7 to 10 days, transf ormant s were 
visible as dark green plants. These were transferred 
onto "hard selection" plates {100 x 15 mm plates with 

15 same recipe as selection plates but with 1.5 % phytagar) 
to eliminate any pseudo-resistants , then replaced in the 
growth room. 

12. After 10 to 14 days, the plants possessed 
at least two sets of true leaves. At this point, plants 

20 were transferred to soil, covered with plastic, and moved 
to a growth chamber with normal conditions. They were 
typically kept covered for several days . 

References : 

25 Bechtold N, Ellis J, Pelletier G (1998) Methods 

Mol Biol. 82: 259-266. 

Bent A, Kunkel BN, Dahlbeck D, Brown KL, 

Schmidt R, Giraudat J, Leung J, Staskawicz BJ (1994) 

Science 265: 1856-1860. 
30 Koncz C, Schell J (1986) Mol. Gen. Genet. 204 : 

383-396. 

Solutions : 
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lOOOX B5 vitamins (10 mL) : 

1000 mg myo-inositol 
100 mg thiamine -HCl 
10 mg nicotinic acid 
5 10 mg pyridoxine-HCl 

Dissolve in ddH20 and store at -2 0 °C. 



Arabidopsis fertilizer (10 liters) : 

50 mL IM KN03 
10 25 mL IM KP04 (pH 5.5) 

20 mL IM MgS04 

20 mL IM Ca(N03)2 

5 mL O.IM Fe.EDTA 

10 mL micronutrients (see below) 
15 Dissolve in ddH20 and store at room temperature 

Arabidopsis micronutrients (50 0 mL) : 

70 mL 0.5M boric acid 

14 mL 0.5M MnC12 
2 0 2.5 mL IM CuS04 

1 mL 0 . 5M ZnS04 

1 mL 0 . IM NaMo04 

1 mL 5M NaCl 

0.05 mL O.IM CuC12 
25 Dissolve in ddH20 and store at room temperature 



EXAMPLE 3 
Customizing OLEl to Express 
Post-Translational Modi fic at. ions 

30 

After determining the optimized codon 
preferences of OLEl mRNA (or raRNA derived from another 
fungal or animal desaturase) for high level expression in 
the host plant, specific amino acids that are involved in 
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the post-translational control of enzyme activity or 
stability are altered to maximize the catalytic activity 
of the expressed enzyme. There are a number of protein 
kinase and/or phosphorylase consensus sequences that are 
highly conserved in the fungal and animal desaturases . 
These are shown below. First is shown a table of aligned 
potential phosphorylation sites in desaturases . Next is 
shown a pileup of A-9 fatty acid desaturases. PROSITE 
analysis of these desaturases predicts a number of 
potential phosphorylation sites, highlighted by bold 
underlined characters. 
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Pileup of A- 9 fatty acid desaturases showing potential 
phosphorylation sites: 

1 50 

Rat MPAHM LQE.ISSSY. 

Mouse MPAHM LQE.ISSSY. 

Sheep MPAHL LQEEISSSY. 

Pig SSY. 

Human MPAHL LQDDISSSY. 

Hamster MPGHL LQEEMTSSYT 

Drosophila MPP NAQAGAQSIS 

Moth 

C.elegans — MTVKTRSN lAKKIEKDGG 

S.cerevisiae MPTSGTTIEL IDDQFPKDDS ASSGIVDEVD LTEANILATG LNKKAPRIW 

P. angusta 

H. capsulatum 

M. rouxii ~ • 

C . curvatus 

C. merolae MTAKVESKVR EEEKGSNPST 



Rat 
Mouse 
Sheep 
Pig 
Human 
Hamster 
Drosophila 
Moth 
C .elegans 
S . cerevisiae 
P. angusta 
H. capsulatum 
M. rouxii 
C . curvatus 
C. merolae 



51 

TTTTTITEPP 
TTTTTITAPP 
TTTTTITAPP 
TTTTTITAPS 
TTTTTITAPP 
TTTTTITEPP 
DSLIAAASAA 

PETQYLAVDP 
GFGSLMGSKE 
MGTKS 



-MSASTKQAS 
AAADDSGAVI 



SGNLQNGREK 
SG. . .NEREK 
SRVLQNGGGK 
SRVLQNGGGK 
PGVLQNGGDK 

SESLQ 

ADAGQSPTKL 

MPPQG 

NEIIQLQEES 
MVSVEFDKKG 
MTDVTAEEL . 
LNEAPTASPV 

MSN 

TTVAQPSGKP 
PTLKPRPKPA 



MKKVPLYLEE 
VKTVPLHLEE 
LEKTPLYLEE 
SEKTPQYVEE 
LETMPLYLED 

. KTVPLYLEE 
QEDSTGVLFE 
QTGGSWVLYE 
KKWPKCLPA 
NEKKSNLDRL 

. . SKDSVAMM 
AETAAGGKDV 
lATLTSTART 
VTNVIDPERD 
VEPLEREGVE 



DI . 



RPE 
RPE 
RPE 

DI RPE 

DI RPD 

DI RPE 

CD VET 

TD AVN 

RLPTAACKAS 
LEKDNQEKEE 
LAKDRELKNK 
VTDAARRPNS 
KTESMKPPLP 
DFIVPDNYVT 
FDPQRGLVFE 



100 

MREDIHDPSY 
MKEDIHDPTY 
MRDDIYDPNY 
MKDDIYDPTY 
IKDDIYDPTY 
MKEDIYDPSY 
TDGGLVKDIT 
TDTD. .APVI 
QENGECQKIV 
AKTKIH.ISE 
YLKQKH.ISE 
EPKKVH.ITD 
KTKMPP.LFD 
RTVENM . KML 
KTRSSKWMSE 



Rat 
Mouse 
Sheep 
Pig 
Human 
Hamster 
Drosophila 
Moth 
C.elegans 
. cerevisiae 
P. angusta 
capsulatum 
M. rouxii 
curvatus 
C. merolae 



101 

QDEEGPPPKL 
QDEEGPPPKL 
QDKEGPKPKL 
QDKEGPQGKL 
KDKEGPSPKV 
QDEEGPPPKL 
VMKKAEKRLL 
VPPSAEKREW 
FLEIVIPYKM 
QPWTLNNWHQ 
QPWTWENWHR 
TPITLANWHK 
QPVTSKNWTK 
PPVTWRNLHK 
KELNELPLLQ 



EYVWRNIILM 
EYVWRNIILM 
EYVWRNIILM 
EYVWRNIILM 
EYVWRNIILM 
EYVWRNIILM 
KLVWRNIIAF 
KIVWRNVILM 
EIVWRNVALF 
HLNWLNMVLV 
HINWLNFILV 
HISWLNVTLI 
FVNWPQAILL 
NIQWISFLAL 
RINWLS.TSI 



ALLHVGALYG 
VLLHLGGLYG 
GLLHLGALYG 
SLLHLGALYG 
SLLHLGALYG 
ALLHLGALYG 
GYLHLAALYG 
GMLHIGGVYG 
AALHFAAAIG 
CGMPMIGWYF 
LAVPFAG. .L 
lAIPIYG. .L 
CVTPLIALYG 
TIPPAMAIYG 
IFTPLIGT.L 



ITL.IPSSKV 
IIL.VPSCKL 
ITL . IPTCKI 
IIL.IPTCKI 
ITL.IPTCKF 
LVL.VPSSKV 
AYLMVTSAKW 
AYLFLTKAMW 
LYQLIFEAKW 
ALSGKVPLHL 
ISTKWVPLKL 
VQAYWVPLHL 
I FT. .TELTK 
LCT. .VPVQT 
IGIWFVPLQR 



150 

YTLLWGIFYY 
YTALFGIFYY 
YTFLWVLFYY 
YTLLWAFAYY 
YTWLWGVFYY 
YTLLWAFVYY 
QTCILAYFLY 
LTDLFAFFLY 
QTVIFTFLLY 
NVFLFSVFYY 
KTFVTAVILY 
KTALWAWYY 
KTLIWSWIYY 
KTFIWSWYY 
KTLVLAIVTY 
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Rat LISALGITAG TffiRLWSHRTY KARLPLRIFL IIANTMAFQN DVYEWARDHR 

Mouse MTSALGITAG AHRLWSHRTY KARLPLRIFL IIANTMAFQN DVYEWARDHR 

Sheep VISALGITAG VHRLWSHRTY KARLPLRVFL IIANTMAFQN DVFEWSRDHR 

Pig LLSAVGVTAG AHRLWSHRTY KARLPLRVFL IIANTMAFQN DVYEWARDHR 

Human FVSALGITAG AHRLWSHRSY KARLPLRLFL IIANTMAFQN DVYEWARDHR " 

Hamster VISIEGIGAG VHRLWSHRTY KARLPLRIFL IIANTMAFQN DVYEWARDHR 

Drosophila VISGLGITAG AHRLWAHRSY KAKWPLRVIL VIFNTIAFQD AAYHWARDHR 

Moth LCSGLGITAG AHRLWAHKSY KARLPLRLLL TLFNTLAFQD AVIDWARDHR 

C. elegans VFGGFGITAG AHRLWSHKSY KATTPMRIFL MILNNIALQN DVIEWARDHR 

S.cerevisiae AVGGVSITAG YHRLWSHRSY SAHWPLRLFY AIFGCASVEG SAKWWGHSHR 

P. angusta CFGGISITAG YHRHWAHRAY DCKLPVKIFF ALFGASAVEG SIKMWGHQHR 

H. capsulatum FMTGLGITAG YHRLWAHCSY SATLPLKIYL AAVGGGAVEG SIRWWARGHR 

M. rouxii FITGLGITAG YHRMWSHRAY RGTDLLRWFM SFAGAGAVEG SIYWWSRGHR 

C. curvatus FITGLGITAG YHRLWAHRSY NASKPLQYFL ALCGAGSVQG SIRWWSRGHR 

C. merolae FCCGLGITGG YHRLWSHRSY EAHWLVQVIL ACFGAAAFEG SARYWCRLHR 



Rat 
Mouse 
Sheep 
Pig 
Human 
Hamster 
Drosophila 
Moth 
C. elegans 
S .cerevisiae 
P. angusta 
H . capsulatum 
M. rouxii 
C . curvatus 
C. merolae 



201 

AHHKFSETHA 
AHHKFSETHA 
AHHKFSETDA 
AHHKFSETDA 
AHHKFSETHA 
AHHKFSETYA 
VHHKYSETDA 
MHHKYSETDA 
CHHKWTDTDA 
IHHRYTDTLR 
VHHRYTDTPR 
AHHRYTDTDK 
AHHRWTDTDK 
AHHRYTDTKL 
AHHRYVDSDR 



DPHNSRRGFF 
DPHNSRRGFF 
DPHNSRRGFF 
DPHNSRRGFF 
DPHNSRRGFF 
DPHNSRRGFF 
DPHNATRGFF 
DPHNATRGFF 
DPHNTTRGFF 
DPYDARRGLW 
DPYDAKRGFW 
DPYSVRKGLL 
DPYSAHRGFF 
DPYSAHEGFW 
DPYAVEKGFW 



FSHVGWLLVR 
FSHVGWLLVR 
FSHVGWLLVR 
FSHVGWLLVR 
FSHVGWLLVR 
FSHVGWLLVR 
FSHVGWLLCK 
FSHVGWLLVR 
FAHMGWLLVR 
YSHMGWMLLK 
YSHMGWMLLV 
YSHIGWMVMK 
FSHFGWMLVQ 
HAHMGWMLI . 
YAHLWWMVFK 



KHPAVKEKGG 
KHPAVKEKGG 
KHPAVREKGA 
KHPAVKEKGG 
KHPAVKEKGS 
KHPAVKEKGG 
KHPEVKAKGK 
KHPQIKAKGH 
KHPQVKEQGA 
PNP . . . KYKA 
PNP . . . RYKA 
QNP . . . KRIG 
RPK. . .NRIG 
KPR . . . GKIG 
LPR . . . QRQG 



250 

KLDMSDLKAE 
KLDMSDLKAE 
TLDLSDLRAE 
LLNMSDLKAE 
TLDLSDLEAE 
KLDMSDLKAE 
GVDLSDLRAD 
TIDLSDLKSD 
KLDMSDLLSD 
RADITDMTDD 
RADISDLLDD 
RTEITDLNED 
YADVADLKAD 
VADISDLSKN 
RVDITDLNAN 



Rat 
Mouse 
Sheep 
Pig 
Human 
Hamster 
Drosophila 
Moth 
C. elegans 
S.cerevisiae 
P. angusta 
H . capsulatum 
M , rouxi i 
C . curvatus 
C . merolae 



251 

KLVMFQRRYY 
KLVMFQRRYY 
KLVMFQRRYY 
KLVMFQRRYY 
KLVMFQRRYY 
KLVMFQRRYY 
PILMFQKKYY 
PILRFQKKYY 
PVLVFQRKHY 
WTIRFQHRHY 
WWRVQHRHY 
PVWWQHRNY 
HWAFQHKYY 
PWKWQHNNY 
PILRFQHRYY 



KPGLLLMCFI 
KPGLLLMCFI 
KPGVLLLCFI 
KPGILLMCFI 
KPGLLMMCFI 
KPAILLMCFI 
MILMPIACFI 
LTLMPLICFI 
FPLVILCCFI 
ILLMLLTAFV 
LLLMWMAFL 
LKWIFMGIV 
PYFALGMGFI 
VALLFFMGLA 
LQIAILFSFV 



LPTLVPWYCW 
LPTLVPWYCW 
LPTLVPWYLW 
LPTIVPWYCW 
LPTLVPWYFW 
LPTFVPWYFW 
IPTWPMYAW 
LPSYIPT.LW 
LPTIIPVYFW 
IPTLICGYFF 
FPAVLTHYLF 
FPMLVSGLGW 
FPTLVAGLGW 
FPTLVAGLGW 
IPLTISTLGW 



GETFLHSLFV 
GETFVNSLFV 
GESFQNSLFF 
GEAFPQSLFV 
GETFQNSVFV 
GEAFVNSLCV 
GESFMNAWFV 
GESAFNAFFV 
KETAFIAFYT 
ND . YMGGLIY 
ND . FWGGFIY 
GD.WFGGFIY 
GD . FRGGYFY 
GD . WWGGLFF 
GD . FWGGLVY 



300 

STFLRYTLVL 
STFLRYTLVL 
ATFLRYAWL 
ATFLRYAIVL 
ATFLRYAWL 
STFLRYTLVL 
ATMFRWCFIL 
CSIFRYVYVL 
AGTFRYCFTL 
AGFIRVFVIQ 
AGLLRAWIQ 
AGILRIFFVQ 
AGVLRLCFVH 
AGAARLVFVH 
ACLGRMLFVQ 
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Rat 
Mouse 
Sheep 
Pig 
Human 
Hamster 
Drosophila 
Moth 
C. e lagans 
S .cerevisiae 
P. angusta 
H. capsulatum 
M. rouxii 
C . curvatus 
C . merolae 



301 

NATWLVNSAA 
NATWLVNSAA 
NATWLVNSAA 
NATWLVNSAA 
NATWLVNSAA 
NATWLVNSAA 
NVTWLVNSAA 
NVTWLVNSAA 
HATWCINSAA 
QATFCINSLA 
QATFCVNSLA 
QATFCVNSLA 
HATFCVNSLA 
HSTFCVNSLA 
QSTFCVNSLA 



HLYGYRPYDK 
HLYGYRPYDK 
HMYGYRPYDK 
HLYGYRPYDK 
HLFGYRPYDK 
HLYGYRPYDK 
HKFGGRPYDK 
HLWGSKPYDK 
HYFGWKPYDS 
HYIGTQPFDD 
HWIGEQPFDD 
HWLGDQPFDD 
HYLGESTFDD 
HWLGETPFDN 
HWWGEQTFSR 



NIOSRENILV 
NIQSRENILV 
TINPRENILV 
TISPRENILV 
NISPRENILV 
NIDPRENALV 
FINPSENISV 
NINPVETRPV 
SITPVENVFT 
RRTPRDNWIT 
RRTPRDHVLT 
RNSPRDHIVT 
HNTPRDSWVT 
KHTPKDHFIT 
RHTSYDSVIT 



SLGSVGEGFH 
SLGAVGEGFH 
SLGAVGEGFH 
SLGAVGEGFH 
SLGAVGEGFH 
SLGCLGEGFH 
AILAFGEGWH 
SLWLGEGFH 
TIAAVGEGGH 
AIVTFGEGYH 
ALVTFGEGYH 
ALVTLGEGYH 
ALVTMGEGYH 
ALVTVGEGYH 
ALVTLGEGYH 



350 

NYHHAFPYDY 
NYHHTFPFDY 
NYHHTFPYDY 
NYHHTFPYDY 
NYHHSFPYDY 
NYHHAFPYDY 
NYHHVFPWDY 
NYHHTFPWDY 
NFHHTFPQDY 
NFHHEFPTDY 
NFHHEFPSDY 
NFHHEFPSDY 
NFHHQFPQDY 
NFHHQFPMDF 
NFHHEFPHDY 



Rat SASEY.RWHI NFTTFFIDCM AALGLAYDRK KVSKAAVLAR IKRTGDGSHK 

Mouse SASEY.RWHI NFTTFFIDCM AALGLAYDRK KVSKATVLAR IKRTGDGSHK 

Sheep SASEY.RWHI NFTTFFIDCM AAIGLAYDRK KVSKAAVLGR MKRTGEESYK 

Pig SASEY.RWHI NLTTFFIDCM AALGLAYDRK KVSKAAIL 

Human SASEY.RWHI NFNTFFIDWM AALGLTYDRK KVSKAAILAR IKRTGDGNYK 

Hamster SASEY.RWHI NFTTFFIDCM AALGLAYDRK KVSKAAVLAR IKRTGDGSCK 

Drosophila KTAEFGKYSL NFTTAFIDFF AKIGWAYDLK TVSTDIIKKR VKRTGDGTHA 

Moth KTAELGDYSL NFTKMFIDFM ASIGWAYDLK TVSTDVIQKR VKRTGDGSHA 

C. elegans RTSEYS.LKY NWTRVLIDTA AALGLVYDRK TACDEIIGRQ VSNHGCDIQR 

S.cerevisiae RNA.IKWYQY DPTKVIIYLT SLVGLAYDLK KFSQNAIEEA LIQQEQKKIN 

P. angusta RNA.LKWYQY DPTKWIYLL SKVGLAYNLK KFSQNAIDQG ILQQQQKKLD 

H. capsulatum RNA.IEWHQY DPTKWTIWIW KQLGLAYDLK QFRANEIEKG RVQQLQKKID 

M. rouxii RNA.IKFGQY DPTKWKIIVL SWFGLAYELK QFPTNEVTKG RLFMEEKRIQ 

C. curvatus RNA.IKWYQY DPTKWFIWTM AQLGLASHLK KFPDNEIKKG QYTMKLMQLQ 

C. merolae RNG.WWYHW DPTKWVIRLL SWAGLAWHLV RFPRNELVKA RLQVRQEILD 



401 450 

Rat SS* 

Mouse SS 

Sheep SG 

Pig , 

Human SG . ™ . 

Hamster SG — ■ 

Drosophila TWGWGDVDQP KEEIE.DAVI THKKSE II 

Moth VWGWDDHEVH QEDKKLAAII NPEKTE 

C. elegans GKSIM III 

S.cerevisiae KKKAKINWGP VLTDLPMWDK QTFLAKS.KE NKGLVIISGI VHDVSGYISE 

P. angusta RMRAKLNWGP QLSELPVWDK STFFEKA.KE QKGLVIISGI VHDCANFLTE 

H. capsulatum QRRAKLDWGI PLEQLPVIEW DDYVDQA.KN GRGLIAIAGV VHDVTDFIKD 

M. rouxii AQKAKLSYGT PLKDLPIYTW EEYQSLVLND NKKKVLIEGV LYDVEEFMKE 

C. curvatus EQSEKLEWPK HSNDLPVISW EDFQA. .ESK TRALIAVHGF IHDCSSFIED 

C. merolae EAKKRVDWGK PIESLPVWTW KDVQRLAKEE NRLLWIEGI VHDCTRFKVQ 
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451 500 

Rat 

Mouse 

Sheep ^ 

Pig ZIIIIIIIII 

Human 

Hamster , ^ . 

Drosophila 

Moth IIIIIIIII 

elegans ~ 

cerevisiae HPGGETLIKT ALGKDATKAF SGGVYRHSNA AQNVLADMRV AVIKESKNSA 

P. angusta HPGGQALLKT SFGKDATMAF KGGVYAHSKA AHNLLATMRV AVIRDGGANG 

capsulatum HPGGKAMINS GIGKDATAMF NGGVYNHSNA AHNQLSTMRV GVIRGGCEVE 

M. rouxii HPGGMKYLST AVGKDMTTAF NGGIYNHSNG TRNLLTSLRV GVLRNGMQV. 

curvatus HPGGAHLIKR AIGTDSTTAF FGGVYDHSNA AHNLLAMMRV GVLDGGMEVE 

C. merolae HPGGQRILEF WNVRDATQAF NGDVYNHTKA ARNLLAHLRV AQLKEIYEPE 



Protein kinase (specifically cAMP- and 
cGMP- dependent) phosphorylation sites. There have been a 
number of studies relative to the specificity of cAMP- and 
cGMP- dependent protein kinases (Freraisco J.R. et al . , J. 
Biol. Chem. 255:4240-4245, 1980; Glass D.B., Smith S.B., J. 
Biol. Chem. 258:14797-14803, 1983; Glass D.B. et al . , J. 
Biol. Chem. 261:2987-2993, 1986). Both types of kinases 
appear to share a preference for the phosphorylation of 
serine or threonine residues found close to at least two 
consecutive N-terminal basic residues. It is important to 
note that there are quite a number of exceptions to this 
rule. However, the consensus pattern is as follows: 
[RK] (2)-x-[ST], where S or T is the phosphorylation site. 

Protein kinase C phosphorylation site. In vivo, 
protein kinase C exhibits a preference for the 
phosphorylation of serine or threonine residues found close 
to a C-terminal basic residue (Woodget J.R. et al . , Eur. J. 
Biochem. 161:177-184, 1986;. KishimotoA. et al . , J. Biol. 
Chem. 260:12492-12499, 1985). The presence of additional 
basic residues at the N- or C-terminus of the 
target amino acid enhances the Vmax and Km of the 
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phosphorylation reaction. The consensus pattern is: 
[ST] -X- [RK] where S or T is the phosphorylation site. 

Casein kinase II phosphorylation site. Casein 
kinase II (CK-2) is a protein serine/threonine kinase whose 
5 activity is independent of cyclic nucleotides and calcium. 
CK-2 phosphorylates many different proteins. The substrate 
specificity ( Pinna L.A., Biochim. Biophys. Acta 
1054:267-2 84, 1990) of this enzyme can be summarized as 
follows: (1) Under comparable conditions Ser is favored 

10 over Thr; (2) an acidic residue (either Asp or Glu) must be 
present three residues from the C-terminal end of the 
phosphate acceptor site; (3) additional acidic residues in 
positions +1, +2, +4, and +5 increase the phosphorylation 
rate (most physiological substrates have at least one 

15 acidic residue in these positions) ; (4) Asp is preferred to 
Glu as the provider of acidic determinants; and (5) a basic 
residue at the N-terminus of the acceptor site decreases 
the phosphorylation rate, while an acidic one will increase 
it. The consensus pattern is: [ST] -x (2 ) - [DE] where S or T 

20 is the phosphorylation site (note: this pattern is found in 
most of the known physiological substrates) . 

If phosphorylation of a specific site by any 
kinase is found to increase the catalytic activity or 
stability of the encoded desaturase protein, the 

25 phosphorylated serine or threonine residue is changed to 
encode a negatively charged amino acid (aspartic acid or 
glutamic acid) in order to permanently optimize the 
activity or the protein. If phosphorylation of a specific 
residue is found to decrease the activity or stability of 

3 0 the encoded desaturase, the affected serine or threonine 
encoding codon is altered to substitute a neutral or a 
positively charged amino acid that will permanently 
optimize the activity or stability of the protein. 
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EXAMPLE 4 

Further Modifications and Improvements of the 
Saccharomyces cerevlslae OLEl Gene for Plant Expression 
5 Using Elements Derived from Native Plant Desaturase Genes - 

The activity of the native or modified forms of 
the Saccharomyces cerevisiae OLEl A- 9 desaturase gene in 
plant tissues may be further improved by the substitution 

10 or inclusion of elements derived from native plant 

desaturase genes. Favorable plant gene elements may 
include sequences that improve the expression of the 
modified gene at one or more levels, including the 
following: 1) transcription, 2) pre-mRNA processing, 3) 

15 mRNA transport from the nucleus to the cytoplasm, 4) mRNA 
stability 5) translation, 6) targeting or retention of the 
protein at the appropriate membrane surface or organelle 
surface, 7) protein folding and maturation, and 8) 
stability of the functional desaturase protein. 

20 The inventors have shown that the OLEl gene can 

tolerate significant modifications without losing its 
biological activity. These modifications include deletion 
of the "coiled coil" region, the addition of 239 amino 
acids to the N-terminus of OLElp and truncation of 55 and 

25 60 amino acids from the N-terminal end of the protein. The 
inventors have also shown that modifications of the 5' and 
3 'untranslated regions of the OLEl mRNA can significantly 
affect its stability. For example, removing a short open 
reading frame near the 5 ' "cap" region of the OLEl raKMA 

30 increases its half-life in Saccharomyces from 12 minutes to 
approximately 25 minutes. The existence of elements in the 
mRNA that affect its stability indicate that other elements 
might also exist that affect the stability of an mRNA 
generated by a synthetic gene in another host organism. 

35 Plant desaturase gene elements that enhance the 
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f unction of the modified A- 9 desaturase gene are identified 
by a 2 -step method. STEP 1 involves isolating a series of 
DMA sequences from a cDNA that encodes a plant ER lipid 
biosynthetic enzyme. Those elements are linked, or 
inserted into regions of a native or "optimized" gene under 
control of a yeast promoter in a vector suitable for 
expression in Saccharomyces cerevisiae. The resulting 
vectors are then tested for their ability to produce 
functional desaturase enzymes in strains of Saccharomyces 
that contain an inactive form of the A- 9 fatty acid 
desaturase gene . 

In STEP 2, plant desaturase sequences from the 
above vectors that are found to produce a functional A- 9 
desaturase gene are used to a isolate homologous sequences 
from plant genomic DNA. The isolated genomic sequences are 
used to construct a synthetic gene that produces an mRNA 
that encodes the same functional desaturase protein 
produced by the vector in step 1. In this instance, the 
genomic sequences encompass the same protein coding 
elements as those encoded by the homologous cDNA sequence 
and also include genomic elements that encode the 5' and/ 
or 3' untranslated regions of the plant desaturase mRNA. 
These combined genomic elements should differ from the cDNA 
derived sequences used in STEP 1 by containing authentic 
plant introns, {which may facilitate efficient and correct 
splicing of the chimeric mRNA in the plant nucleus) and 
signals that affect the mRNA stability, mRNA transport, and 
efficient translation of the mRNA in plant tissues. The 
chimeric plant / synthetic gene containing the genomic 
sequences is inserted into vectors under the control of 
plant seed- specif ic promoters and tested for expression and 
desaturase function in plants, including Brassica, 
Arabidopsis, maize and soybeans. 
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The following specific examples further 
illustrate these methods employing the Arabidopsis FAD2 
gene, which encodes an ER A12 -desaturase , as a source of 
plant desaturase DNA sequences. In the preferred 
embodiment, the source of the plant desaturase DNA would be 
the FAD2 horaolog, or a related ER lipid biosynthetic gene, 
that is derived from the same plant species that is 
intended to be modified by the resulting vector for 
commercial use. 

A. Substitution of the N- terminal OLEl protein 
coding sequences and with N- terminal sequences from the 
derived from the Arabidopsis FAD2 gene. 

1) A cDNA containing the FAD2 , A- 12 desaturase, 
mRNA coding sequence is isolated by reverse transcriptase - 
polymerase chain reaction (RT-PCR) of isolated mRNAs 
derived from Arabidopsis tissue or by direct DNA synthesis 
using the protein and DNA sequences set forth in SEQ ID 
NO: 4 and SEQ ID NO : 5 (open reading frame starts at +93) . 

2) The inventors have shown that substitution of 
transmembrane sequences of the OLEl gene with transmembrane 
sequences from the Saccharomyces FAH2 gene abolishes A- 9 
desaturase activity. FAH2 encodes a sphingolipid fatty 
acid hydroxylase, which is an ER membrane protein. 
TMPredict analysis of the Arabidopsis FAD2 sequence 
indicates that the first transmembrane region of its 
encoded protein begins at residue +52 and a similar 
analysis of the OLEl sequence indicates that its first 
transmembrane sequence begins at residue +113 . Because the 
inclusion of potential membrane spanning elements from the 
plant desaturase could produce significant changes in the 
desaturase core enzyme structure that affect activity, only 
sequences encoding residues +1 to +52 of FAD2 are tested 
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for functional linkages or substitutions in the 113 residue 
M-terminal region of OLEl . 

A series of PGR oligonucleotide primers are 
synthesized that include a 5 ' primer that complements 
sequences including +1 start codon of the FAD2 gene and 3' 
primers that complement sequences ending, for example, at 
residues +20, +35 and + 52 of the FAD2 gene. These are 
used to amplify a series of fragments of different lengths 
from the FAD2 cDNA that extend from the +1 codon through 
codon +52. A second PGR amplification is performed using a 
5 ' primer that is complementary to sequences that include 
the 5 ' end of the FAD2 mRNA and the 3 ' primer that includes 
codon +52. That amplification is done using Arabidopsis 
genomic DNA as a template. The amplified fragment from 
that reaction is cloned into a bacterial vector and 
subjected to DNA sequencing to detect the presence of 
introns within the genomic sequence. The cloned genomic 
fragment is also used to construct vectors for plant 
expression as indicated in STEP 2 of the method. 

The amplified cDNA fragments is inserted into 
yeast expression vectors that contain the native OLEl mRNA 
coding sequence under the control of the Saccharomyces 
galactose inducible, GALl promoter. Insertion of the plant 
DNA fragments can be done in several ways: 1) A fragment is 
inserted upstream of the OLEl protein coding sequences so 
that its protein coding element is fused in frame to the +1 
codon of the OLEl encoded protein, 2) the codons on the 
plant fragment could replace the equivalent OLEl residues 
starting from the +1 ATG codon (e.g. a plant DNA fragment 
containing codons +1 -> +52 replaces OLEl codons +1 -> 
+52) and 3) the full length fragment containing codons +1 - 
> + 52 of the plant gene is fused in frame to codon +114 of 
the OLEl gene, replacing the OLEl residues +1 -> +113 with 
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plant desaturase residues +1 -> +52. 

The resulting plasmids are transformed into a 
haploid olel/]::LEU2 strain of Saccharomyces . That strain 
contains a null, disrupted form of the OLEl gene and 
therefore has a growth requirement for unsaturated fatty 
acids. The transformed Saccharomyces strains are grown on 
fatty acid depleted galactose medium to test for the 
ability of the induced chimeric gene to support growth of 
the strain without fatty acids. Transformed strains that 
grow on the fatty acid deficient medium are further 
analyzed to assess the effects of the plant sequences on 
desaturase function. This is done by Western blot 
analysis, to measure levels of the resulting desaturase 
protein and by fatty acid analysis of total cellular 
lipids, to assess the relative activity of the desaturase 
enzyme by comparison of the ratio of saturated to 
unsaturated fatty acids. 

3) Using information derived from the above 
tests, a chimeric desaturase gene is constructed using the 
amplified genomic DNA from the FAD2 gene. Construction, 
testing, and analysis these vectors is guided by the 
principle that the most desirable vector is one that 
maximizes the use of the plant gene sequences and minimizes 
the use of the Saccharomyces A- 9 desaturase gene sequences 
while retaining optimal desaturase function. Plant DNA 
fragments derived from the genomic DNA amplification that 
extend from the 5' end of the mRNA sequence to the longest 
sequence that produces optimal desaturase function in yeast 
are inserted into a vector containing the native A- 9 
desaturase gene (or one of its modified forms produced by 
the methods described above) . The fragment is inserted 
into the vector so that the 3' end of its protein coding 
sequence produces an mRNA that generates a protein sequence 
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identical to its counterpart derived from the FAD2 cDNA 
sequences. The resulting chimeric desaturase gene, which 
now encodes an mRNA that includes the FAD2 5 • untranslated 
region in addition to the modified protein coding 
sequences, is placed into a plant expression vector under 
the control of a suitable plant promoter and plant 
termination/ polyadenylation sequences. 

4) The resulting vectors containing the 
plant/yeast chimeric desaturase sequences are transformed 
into plants for testing and analysis of desaturase 
function. Suitable test plants include Arahidopsis 
thaliana, and Brassica. napis. A method for transformation 
and analysis of desaturase gene expression in Arabidopsis 
is provided above. A method for transformation and 
analysis of yeast desaturase expression in Brassica napis 
is described in U.S. Patent No. 5,777,201 to Poutre et al . 
(incorporated by reference herein) . 



B. Insertion or substitution of Arabidopsis FAD2 
C- terminal protein coding sequences and 3' mRNA 
untranslated region sequences into native and modified 
forms of the OLEl gene. 

The inventors have previously shown that proteins 
encoded by the Saccharomyces EL02 and EL03 genes contain a 
series of charged residues in their C- terminal region. 
These proteins are located on the ER surface and function 
in the biosynthesis of very long chain fatty acids as 
described in Oh et al . (j. Biol. Chem. 272: 17376- 
17384, 1997) (incorporated by reference herein). They 
further showed that deletion of the region containing the 
charged residues causes the proteins to be mislocalized 
from their normal cellular locations in the endoplasmic 
reticulum, resulting in reduced function. Similar clusters 
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of charged residues occurs in the C-terminal region of the 
OLEl gene that are apparently associated with ER retention 
or localization. These residues do not appear to be a part 
of the functional cytochrome domain. A detailed 
5 comparison of the C-terminal OLEl and the Arabidopsis FAD2 
sequences show that the plant desaturase has similar, but 
not identical , clusters of charged residues to those in the 
OLEl gene. These sequences are shown below: 

10 SEP ID Nos ; 6 and 7 : 

Comparison of the charged carboxyl terminal amino acids of 
Olelp (SEQ ID NO: 7) and the Arabidopsis Fad2p desaturase 
(SEQ ID NO: 6) (The region of the OLEl gene shown does not 
appear to be a functional part of its cytochrome bg 
15 domain) . 

+- + - - -++ + 

A. thaliana FAD2 WYVAMYREAK ECIYVEPDRE GDKKGVYWYN NKL* 

20 +-++++-- + 

S.cerevisiae OLEl MRVAVIKESK NSAIRMASKR GEIYETGKFF * 



Methods similar to those shown in Section A can be used to 
25 identify Arabidopsis FAD2 sequences that can replace the 
OLEl C-terminal sequences to optimize gene expression, 
membrane targeting and ER retention of the chimeric enzyme. 

1) A series of oligonucleotide primers for PGR 
amplification are synthesized for isolation of elements in 
3 0 the C-terminal region of the FAD2 gene. A FAD2 DNA 

fragment encompassing that region is generated by PGR 
amplification of the cDNA clone. Alternatively, given the 
smaller size of the fragment it or modified forms of the 
plant fragment may be generated directly by DNA synthesis. 
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A fragment containing that region and its flanking 3 • 
untranslated region also is generated by PGR amplification 
of Arabidopsis genomic DMA as described above. That 
fragment is cloned into an appropriate vector and sequenced, 
as also described. 

2) Vectors are constructed that contain the plant 
DNA fragments linked to or substituted into the OLEl C- 
terminal coding region as described in Section A. In this 
instance, the plant DNA fragments are linked in frame to 
the carboxyl terminal residues of the OLEl protein coding 
region. 

3) The resulting vectors are transformed into the 
Saccharorayces olelzl strain and tested for desaturase 
function as described in Section A. 

4) Using information derived from the above 
tests, chimeric desaturase genes containing the C-terminal 
plant sequences that produce functional desaturases are 
constructed using the amplified genomic DNA from the FAD2 
gene, according to the principles outlined in Section A. 
The resulting sequences are employed to construct vectors 
that will express the chimeric plant/yeast gene under 
control of plant promoter and plant termination/ 
polyadenylation sequences. Those vectors are transformed 
into plants for testing and analysis of desaturase function 
as described above . 

The present invention is not limited to the 
embodiments described and exemplified above, but is capable 
of variation and modification without departure from the 
scope of the appended claims . 
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We claim: 

1. A synthetic fatty acid desaturase gene for 
expression in a multicellular plant, the gene comprising a 
desaturase domain and a cyt domain, wherein the gene is 
customized for expression in a plant cytoplasm. 

2. The synthetic gene of claim 1, customized 
from a naturally occurring gene encoding a cytosolic A- 9 
desaturase . 

3. The synthetic gene of claim 2, customized 
from a naturally occurring gene from Saccharomyces 
cerevisiae . 

4. The synthetic gene of claim 3, customized 
from a naturally occurring gene from Saccharomyces 
cerevisiae that encodes SEQ ID NO: 2. 

5. The synthetic gene of claim 4, customized 
from a naturally occurring gene from Saccharomyces 
cerevisiae comprising SEQ ID NO:l. 

6. The synthetic gene of claim 3, comprising SEQ 

ID NO : 3 . 

7. The synthetic gene of claim 1, which further 
comprises an expression regulatory sequence from a plant 
gene encoding an ER biosynthetic pathway enzyme. 

8. The synthetic gene of claim 1, customized for 
expression in a monocotyledonous plant. 
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9. The synthetic gene of claim 1, customized for 
expression in a dicotyledonous plant. 



10. The synthetic gene of claim 1, customized 
5 for expression in a plant genus selected from the group 
consisting of Araibidopsis, Brassica, Phaeseolus , Oryza, 
Olea, Elaeis (Oil Palm) and Zea. 



11. The synthetic gene of claim 1, customized 
10 from a naturally occurring gene comprising both a 
desaturase domain and a cyt bg domain. 



12. The synthetic gene of claim 1, wherein the 
gene is a chimeric gene comprising a desaturase domain and 

15 a heterologous cyt he, domain. 

13. The synthetic gene of claim 1, customized 
from a naturally occurring gene such that the synthetic 
gene and the naturally occurring gene encode an identical 

2 0 amino acid sequence. 



14. The synthetic gene of claim 13, wherein the 
synthetic gene and the naturally occurring gene encode SEQ 
ID N0:2 . 

25 

15. The synthetic gene of claim 1, customized 
from a naturally occurring gene such that the synthetic 
gene and the naturally occurring gene encode a similar 
amino acid sequence . 

30 

16. The synthetic gene of claim 1, customized 
from a naturally occurring gene such that the synthetic 
gene and the naturally occurring gene encode a similar 
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amino acid sequence, and the synthetic gene possesses 
improved stability or catalytic activity as compared with 
the naturally occurring gene. 

5 17. A method for constructing a customized 

bifunctional desaturase/cyt bg encoding gene for expression 
in the cytosol of a multicellular plant, comprising the 
steps of: 

(a) providing a DNA molecule comprising a 
10 desaturase-encoding moiety operably linked to a cyt bg- 

encoding moiety, said DNA molecule producing the 
bifunctional polypeptide in a non- custom! zed form; 

(b) back-translating the polypeptide 
sequence using preferred codons for expression in a 

15 multicellular plant, thereby producing a back-translated 
nucleotide sequence; 

(c) analyzing the back- translated nucleotide 
sequence for features that could diminish or prevent 
expression in the plant cytoplasm; 

20 (d) modifying the analyzed sequence to 

correct or remove the features that could diminish or 
prevent expression in the plant cytoplasm; and 

(e) optionally, introducing pre-determined 
cloning features into the sequence in a manner that does 

25 not materially affect the codon usage or final polypeptide 
sequence, thereby producing the customized bifunctional 
desaturase/cyt bs encoding gene for expression in the 
cytosol of a multicellular plant. 

30 18. The method of claim 17, wherein the features 

that could diminish or prevent expression in the plant 
cytoplasm include one or more features selected from the 
group consisting of: putative intron splice sites, plant 
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polyadenylation signals, RNA polymerase II termination 
sequences, and hairpin consensus sequences. 

19. The method of claim 17, which further 
comprises the step of: 

(f) testing the customized bifunctional 
desaturase/cyt encoding gene for desaturase function in 
fatty acid deficient strains of a microorganism prior 
introducing the gene into vectors for expression in plants. 

20. The method of claim 19, wherein the 
microorganism is Saccharomyces cerevisiae. 

21. The method of claim 17, which further 
comprises incorporating into the customized gene one or 
more genomic segments from plant desaturase or other ER 
lipid biosynthetic genes, which comprise beneficial 
elements to further optimize expression of the genes in 
plants, comprising the steps of: 

a) selecting a cDNA sequence that 
potentially comprises one or more of the beneficial 
elements ; 

b) creating a yeast vector expressing a 
desaturase gene modified to contain one or more of the 
beneficial elements; 

c) testing the vector in a yeast expression 

system; 

d) isolating regions from genomic DNA that 
are homologous to the beneficial elements from the cDNA; 
and 

e) operably linking the genomic DNA regions 
to the customized bifunctional desaturase/cyt b^ encoding 
gene to produce the further customized gene. 
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ACKNOWLEDGMENT OF REVIEW OF PAPERS AND DUTY OF CAPTOR 

I hereby state that I have reviewed and understand the contents of the above-identified specification, 
including the claims, as amended by any amendment referred to above. 

I acknowledge the duty to disclose information, which is material to patentability as defined in 37, 
Code of Federal Regulations, Section 1 .56. 

PRIORITY CLAIM (35 U.S.C. Section 1 19(a)-(d)) 

I hereby claim foreign priority benefits under Titie 35, United States Code, Section 119(a)-(d) of 
any foreign application(s) for patent or inventor's certificate or of any PCT international application(s) 
designating at least one country other than the United States of America listed below and have also 
identified below any foreign application(s) for patent or inventor's certificate or any PCT international 
application(s) designating at least one country other than the United States of America filed by me on the 
same subject matter having a filing date before that of the application(s) of which priority is claimed. 



Such appHcations have been filed as follows. 



PRIOR PCX APPLICATION(S) FILED WITHIN 12 MONTHS 
(6 MONTHS FOR DESIGN) PRIOR TO THIS APPLICATION 
AND ANY PRIORITY CLAIMS UNDER 35 U.S.C. SECTION 119(a)-(d) 



INDICATE 
IFPCT 


APPLICATION NUMBER 


DATE QF FILING 
DAY, MONTH, YEAR 


PRIORITY 
CLAIMED 
UNDER 35 U.S.C. 
SECTION 119 


PCT 


US99/19443 


24 August 1999 


yes 



CLAIM FOR BENEFIT OF PRIOR U.S. PROVISIONAL APPLICATION(S) 

(35 U.S.C. Section 119(e)) 

I hereby claim the benefit under Title 35, United States Code, Section 1 19(e) of any United States 
provisional application(s) listed below: 

PROVISIONAL APPLICATION NUMBER FILING DATE 

60/097,586 August 24, 1998 



POWER OF ATTORNEY 

I hereby appoint the following practitioner(s) to prosecute this application and transact all business 
in the Patent and Trademark Office connected therewith. 

APPOINTED PRACTITIONER(S) REGISTRATION NUMBER(S) 

Janet E. Reed 36,252 

Alfred W. Zaher 42,248 

Teresa O. Bittenbender 47,425 

Bruce D. George 43,63 1 



I hereby appoint the practitioner(s) associated with the Customer Number provided below to 
prosecute this application and to transact all business in the Patent and Trademark Office connected 
therewith. 



SEND CORRESPONDENCE TO 



DIRECT TELEPHONE CALLS TO: 



Janet E.'Reed 
215-972-8386 



Janet E . Reed 
_T500Ma^tStiset,J8^^ 

Centre Square West 
"Philadelphia, PA 19102 



DECLARATION 

I hereby declare that all statements made herein of my own knowledge are true and that all 
statements made on information and belief are believed to be true; and further that these statements were 
made with the knowledge that willful false statements and the like so made are punishable by fine or 
imprisonment, or both, under Section 1001 of Title 18 of the United States Code, and that such willful false 
statements may jeopardize the vahdity of the application or any patent issued thereon. 



SIGNATURE(S) 

Date C>'-\~0\ Country of Citizenship US 

Residence Somers eLNJ 

Post Office Address 3 Seely's Run, Somerset, NJ 08873 



Charles E, M artin 
Inventor's signature 



Andrew M itchell (j \ P /^f 

Inventor's signature ^^^'^^^ / {yi~>^yL'\lJ\ 

Date 3 ' 1 - 0^ Country of Citizenship UK , _ 

Residence Highland P ark, N J z'vJ -J 

Post Office Address . 106 North Third Stree t, Highland Park, NJ 08904 |/ 



This replies to the Notice of Missing Requirements under 35 U.S.C. section 371 and 37 C.F.R. 
section 1.494 (FORM PCT/DO/EO/905). 

A copy of FORM PCT/DO/EO/905 accompanies this response. 



DECLARATION OR OATH 

I. No original declaration or oath was filed. Enclosed is the original declaration or oath for this 
application. 



FEES 

n. 

2. Surcharge fees 

Surcharge set forth in Section 1 .492(e), for accepting the 
declaration later than 20 months after the priority date in 
filing an application in the U.S. as a 

designated office $ 65.00 



Total fees $65.00 



SMALL ENTITY STATUS 
in. Applicant claims small entity status under 37 C.F.R. 1 .27. 



EXTENSION OF TIME 

IV. The proceedings herein are for a patent application. The provisions of 37 C.F.R. section 1.1 36(a) 
apply. 



Applicant believes that no extension of term is required. However, this conditional peition is being 
made to provide for the possibility that applicant has inadvertently overlooked the need for a petition and fee 
for extension of time. 



" ' TOTAL FEE DUE 

V. The total fee due is: 

Completion fee(s) $ 65 .00 

Extension fee (if any) $ 00.00 



(Completion of Filing Requirements for International Application Entering National Stage in Designated Office (DO/US) under 

35 U.S.C. section 371-page 2 of 3)604877 1 5/21/01 



TOTAL FEE DUE $65.00 

PAYMENT OF FEES 
VI. Enclosed is a check in the amount of $ 65 .00. 

AUTHORIZATION TO CHARGE ADDITIONAL FEES 

The Commissioner is hereby authorized to charge the following additional fees that may be required 
by this paper to Account No. 50-1089. 
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let E. Reed 
Registration No. 36,252 
Saul Ewing LLP 
Centre Square West, 38th Floor 
1500 Market Street 
Philadelphia, PA 19102 
215-972-8386 
Customer No. 28134 



(Completion of Filing Requirements for International Application Entering National Stage in Designated Office (DO/US) under 

35 U.S.C, sedion 371--page 3 of 3)604877 1 5/21/01 



